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COGNITIVE DISSONANCE: 


FIVE YEARS LATER* 


NATALIA P. CHAPANIS anp ALPHONSE CHAPANIS 
Johns Hopkins University 


This article reviews critically the experimental evidence in support of 
cognitive dissonance theory as applied to complex social events. The 
criticisms which can be made of this literature fall into 2 main classes. 
lst, the experimental manipulations are usually so complex and the 
crucial variables so confounded that no valid conclusions can be drawn 
from the data. 2nd, a number of fundamental methodological inadequa- 
cies in the analysis of results—as, e.g., rejection of cases and faulty 
statistical analysis of the data—vitiate the findings. As a result, one 
can only say that the evidence adduced for cognitive dissonance theory 
is inconclusive. Suggestions are offered for the methodological improve- 
ment of studies in this area. The review concludes with the thesis that 
the most attractive feature of cognitive dissonance theory, its simplicity, 
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is in actual fact a self-defeating limitation. 


Social psychologists have been trying 
for many years to predict the conditions 
under which attitudes and opinions are 
changed. In general their attempts have 
not been conspicuously successful. One 
of the first major breakthroughs in this 
area came when Leon Festinger (1957) 
published his book on A Theory of 


1 This article is based on material prepared 
by the senior author while she was employed 
at the Tavistock Institute of Human Relations, 
London, England. It appeared originally as 
Tavistock Document No. 626, “Cognitive Dis- 
sonance: A Dissenting Voice,” dated June 
1961. That work was part of a research pro- 
gram financed by S. H. Benson, Limited, and 
carried out by the Tavistock Institute. The ex- 
tensive expansion and revision of the original 
document undertaken with the collaboration 
of the junior author was supported in part by 
Contract Nonr 248(55) between the Office of 
Naval Research and the Johns Hopkins Uni- 
versity, We are pleased to acknowledge the 
assistance provided us by both organizations. 
The views expressed in this article are, how- 
Cver, those of the authors. Neither S. H. 

enson, Limited, nor the Office of Naval Re- 


Search is responsible for any of the statements 
contained in it, 


Cognitive Dissonance. In this book the 
author presented a simple conceptual 
scheme by which he could predict with 
precision the outcomes of certain social 
situations. To support his theory, 
Festinger marshaled data from an im- 
pressive variety of field and experimental 
studies. In addition, he and other work- 
ers have since then conducted a num- 
ber of studies designed to test specific 
derivations of the theory. What can we 
say about all this literature? 

Cognitive dissonance theory has al- 
ready been reviewed by Bruner (1957), 
Asch (1958), Osgood (1960), and 
Zajonc (1960). These writers, however, 
have been primarily concerned with a 


We are greatly indebted to the staff of the 
Tavistock Institute, and particularly to Fred- 
erick E. Emery, for many helpful comments, 
guidance, and encouragement throughout the 
preparation of the original document. Grate- 
ful appreciation is also extended to our many 
American colleagues—too numerous to mame 
individually—who read various versions of 
this article and offered helpful comments and 
encouragement. 
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critical evaluation of the conceptual sys- 
tem employed in dissonance theory. And, 
whatever they might think of the theory, 
most workers (except perhaps Asch) 
have been impressed by the scope, 
relevance, and ingenuity of the experi- 
mental evidence gathered in its support. 

There is an engaging simplicity about 

Festinger’s dissonance formulations. No 
matter how complex the social situation, 
Festinger assumes that it is possible to 
represent the meaning which the situa- 
tion has for an individual by a series of 
elementary cognitions—statements that 
an individual might make describing his 
“knowledge, opinions or beliefs” (Fes- 
tinger, 1957, p. 3). Moreover, a simple 
inventory of a group of related cogni- 
tions is sufficient to reveal whether or 
not they are consistent. The theory as- 
sumes further that people prefer con- 
sistency among their cognitions and that 
they will initiate change in order to 
preserve this consistency. So far these 
ideas are not new. They had been pro- 
mulgated as early as 1946 by Heider 
with his concept of balance and im- 
balance. The magic of Festinger’s 
theory, however, seems to lie in the ease 
with which imponderably complex social 
situations are reduced to simple state- 
ments, most often just two such state- 
ments. This having been done, a simple 
inspection for rational consistency is 
enough to predict whether or not change 
will occur. Such uncomplicated ration- 
ality seems especially welcome after hav- 
ing been told for years that our attitudes 
and resulting behavior are strongly de- 
pendent on motivational, emotional, 
affective, and perceptual processes (€g. 
Krech & Crutchfield, 1948; Rosenberg, 
1960). 

Five years have now elapsed since the 
publication of Festinger’s book, and this 
seems to be an appropriate time to pause 
for a close look at the evidence in sup- 
port of the theory. For no matter how 


appealing a theory might be, in the final 
analysis it is the evidence that counts. 
This paper, therefore, will be concerned 
with a review of experiments on cogni- 
tive dissonance in humans from two 
points of view. First we shall consider 
whether an experimenter really did what 
he said he did. Then later we shall con- 
sider whether the experimenter really 
got the results he said he did. 


CONTROVERSIAL EXPERIMENTAL 
MANIPULATIONS 


As we all know, good experimental 
work always involves manipulating con- 
ditions in such `a way that we may 
ascribe changes we observe in our de- 
pendent variables to the manipulations 
we carried out on the independent vari- 
ables. In actual practice we rarely define 
these manipulations in careful opera- 
tional terms. When a pellet drops into 
a cup in front of a hungry rat we call 
it a reward, or reinforcement; when a 
wire transmits an electric shock to a 
person we call it punishment, or stress; 
and so on. Moreover, we do not, in gen- 
eral, quarrel with our fellow experi- 
menter’s interpretation of the situation. 
After all, he was there, he ought to know 
what it was about. However, when we 
deal with experiments on cognitive dis- 
sonance we have a very special problem 
on our hands. 


Experimental Dissonance 


Simply stated, cognitive dissonance 
theory is concerned with what happens 
when the cognitions of a person are dis- 
crepant. The basic premise is that dis- 
crepant cognitions create tension which 
the individual strives to reduce by mak- 
ing his cognitions more consistent. This 
tension is called cognitive dissonance, 
and the drive towards consistency, dis- 
sonance reduction. “When two or more 
cognitive elements are psychologically 
inconsistent, dissonance is created. Dis- 
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sonance is defined as psychological ten- 
sion having drive characteristics” so 
that when dissonance arises the individ- 
ual attempts to reduce it (Zimbardo, 
1960, p. 86). 

For our purposes at the moment the 
most important thing to note about the 
theory is that dissonance is an interven- 
ing variable whose antecedents are the 
private internal cognitions of a person. 
To test a theory like this, it is up to the 
experimenter to create various degrees 
of dissonance by introducing various 
discrepant cognitions within an indi- 
vidual. Whenever contradictory state- 
ments or syllogisms or opinions are used, 
there is not likely to be much contro- 
versy about the fact that they must lead 
to discrepant internal cognitions, and so, 
by definition, to dissonance. Indeed, 
studies on cognitive dissonance of this 
type have yielded results which are well- 
established, clear-cut, and consistent. 
But for the experiments under review 
here, the situation is rarely as simple 
as this. The Festinger group is primarily 
concerned with applying their dissonance 
formulation to predict complex social 
events. In order to do this experi- 
mentally, they use elaborate instructions 
and intricate relationships between ex- 
perimenter and S to introduce discrep- 
ant cognitions and therefore to produce 
dissonance. Under such conditions, how 
can we be sure that the experimental 
situation has been successful in creating 
dissonance and dissonance alone? 

In the face of such difficulties, it is 
always a good policy to ask the S him- 
self about the situation, either directly 
or indirectly. It should be possible, for 
instance, to find out how the Ss perceived 
each of the experimental manipulations. 
One could also determine whether Ss 
perceived the situation as conflictful and, 
if so, to what extent. This kind of in- 
formation is crucial to the theory of 
cognitive dissonance because all its pre- 


dictions are based on the assumption 
that a state of differing, incompatible 
cognitions has been produced within the 
S. Unfortunately, evidence of this kind 
from the Ss themselves is not always 
available in the studies under review 
here. As a result, it is up to the reader 
to decide whether the experimental 
manipulations had the effect which the 
authors claim. 

The other side of the coin, equally 
important, is that we must also assure 
ourselves that the experimental manipu- 
lations did not at the same time produce 
other internal states or cognitions within 
the S which could contaminate or even 
account for the findings. In fact, certain 
‘nonobvious” derivations of some of 
these experiments may perhaps become 
a little more obvious when the experi- 
ments are reinterpreted to take other 
factors into consideration. 

It is worthwhile spending a few mo- 
ments on these nonobvious derivations. 
If we disregard the intermediate steps 
and simply consider the independent and 
dependent variables, it is possible to 
describe the essential aspects of some 
of these derivations by saying that they 
follow a pain principle. Reduced to €s- 
sentials, some of Festinger’s derivations 
say that the more rewarding a situation, 
the more negative is the effect; and con- 
trariwise, the more painful a situation, 
the more positive is the effect. This }§ 
clearly illustrated by the following 
quotation from Festinger (1961): “Rats 
and people come to love the things for 
which they have suffered.” However, if 
we carefully examine the kinds of experi- 
ments which are supposed to test these 
derivations, we find that, in general, 
the situation contains both painful and 
rewarding conditions, but that the 
manipulation is interpreted in terms of 
only one of these. It should be apparent 
that if a situation is both rewarding and 
painful, and the dependent variable 
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shows a positive effect, it is not legiti- 
mate to attribute it solely to the painful 
variable, or vice versa. To use a 
statistician’s terminology, the variables 
are confounded. 

Our most general criticism, then, is 
that some dissonance experiments have 
been designed in such a way that it is 
impossible to draw any definite con- 
clusions from them. 


Examples 


The best way of illustrating these 
points is to describe an experimental 
procedure and then to analyze it from 
two points of view: Did the experimenter 
really produce the discrepant cognitions 
he said he did? Did the experimental 
manipulations produce other cognitions 
that could contaminate or account for 
his findings? 

Relief or Dissonance? Let us take this 
experiment: College women volunteered 
to participate in a series of group discus- 
sions on the psychology of sex. They 
were seen individually by the experi- 
menter before being allowed to join an 
“on-going group.” Some of the girls 
were told they would have to pass an 
embarrassment test to see if they were 
tough enough to stand the group discus- 
sion. They were free to withdraw at this 
point, and one S did so. Girls in the 
severe embarrassment group had to read 
out loud in the presence of the male ex- 
perimenter some vivid descriptions of 
sexual activity and a list of obscene sex 
words. Another group of girls—the mild 
embarrassment group—read some mild 
sexual material. All of these girls were 
told that they were successful in passing 
the embarrassment test. Each S then 

listened as a silent member to a simu- 


lated, supposedly on-going group discus- 
sion, which was actually a standard tape 
recording of a rather dull and banal 
discussion about the sexual behavior of 
animals. A control group listened only 


to the simulated group discussion. All 
groups then made ratings about this 
discussion, its participants, and their 
own interest in future discussions. The 
ratings made by the severe embarrass- 
ment group were, on the average, some- 
what more favorable than those made 
by the other two groups. 

What was this experiment about? Was 
it to demonstrate the effect of feelings 
of relief when people discover that a 
task (the group discussion) is not as 
painfully embarrassing as the embarrass- 
ment test led them to believe? No. Was 
it to demonstrate the effect of success 
in a difficult test (passing the embarrass- 
ment test) on task evaluation? No. Was 
it to demonstrate the displacement of 
vicarious sexual pleasure from a discom- 
fiting, but sexually arousing, situation 
to a more socially acceptable one? No. 
The experimenters called it “The Effect 
of Severity of Initiation on Liking for a 
Group” (Aronson & Mills, 1959); that 
is, the more painful the initiation, the 
more the Ss like the group. They predict 
the outcome for the severe embarrass- 
ment group in the following way: In 
successfully passing the embarrassment 
test these girls “held the cognition that 
they had undergone a painful experi- 
ence” in order to join a group; the dis- 
cussion, however, was so dull and un- 
interesting that they realized the un- 
pleasant initiation procedure was not 
worth it. This produced dissonance since 
“negative cognitions about the discus- 
sion . . . were dissonant with the cogni- 
tion that they had undergone a painful 
experience.” One of the ways they could 
reduce this dissonance was by re- 
evaluating the group discussion as more 
interesting than it really was. 

All this may be so, but in order to 
accept the authors’ explanation we must 
be sure the girls really did hold these 
discrepant cognitions, and no others. 
We have to be sure, for instance, that 
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they felt no relief when they found the 
group discussion banal instead of em- 
barrassing, that success in passing a 
difficult test (the embarrassment test) 
did not alter their evaluation of the task, 
that the sexual material did not evoke 
any vicarious pleasure or expectation of 
pleasure in the future, and that the 
group discussion was so dull that the 
girls would have regretted participating. 
There is no way of checking directly on 
the first three conditions, although other 
experimental evidence suggests that their 
effect is not negligible. However, to 
check on the fourth factor we have the 
data from the control group showing that 
the group discussion was, in fact, more 
interesting than not (it received an aver- 
age rating of 10 on a 0-15 scale). It 
is, therefore, difficult to believe that the 
girls regretted participating. To sum up, 
since the design of this experiment does 
not exclude the possibility that pleasur- 
able cognitions were introduced by the 
sequence of events, and since, in addi- 
tion, the existence of “painful” cogni- 
tions was not demonstrated, we cannot 
accept the authors’ interpretation with- 
out serious reservations. 

It is interesting to speculate what 
would have happened if the girls had 
been “initiated” into the group by the 
use of a more generally accepted painful 
procedure, such as using electric shock. 
Somehow it seems doubtful that this 
group would appreciate the group dis- 
cussion more than the control group, 
unless—and here is the crucial point— 
the conditions were so manipulated 
that Ss experienced a feeling of suc- 
cessful accomplishment in overcoming 
the painful obstacle. It seems to us 
that if there is anything to the rela- 
tionship between severity of initia- 
tion and liking for the group, it lies in 
this feeling of successful accomplish- 
ment. The more severe the test, the 
stronger is the pleasurable feeling of suc- 


cess in overcoming the obstacle. There 
is no need to postulate a drive due to 
dissonance if a pleasure principle can 
account for the results quite successfully. 

The same feeling of successful ac- 
complishment may, incidentally, be the 
relevant variable involved in some of the 
“effort” experiments done by the Fes- 
tinger group (e.g., Cohen, 1959). It 
seems reasonable to expect that in such 
experiments the higher the degree of 
perceived effort, the greater the feeling 
of successful accomplishment in per- 
forming a task. Thus, effort would be 
confounded with feeling of success. Note, 
however, that success is pleasant, 
whereas effort is painful. Here is a 
situation which could be both rewarding 
and painful, but dissonance workers see 
it only as painful. (Two other effort 
experiments will be analyzed in greater 
detail later in this section.) 

Reward or Incredulity? Let us look 
at another experiment, this time by 
Festinger and Carlsmith (1959): Out of 
several possibilities, Ss chose to take part 
in a 2-hour experiment falsely labeled 
as an experiment on “measures of per- 
formance.” The Ss were tested individ- 
ually and were given a “very boring” 
and repetitive task for about 1 hour. 
At the end of the hour each S was given 
a false explanation about the purpose 
of the experiment. He was told that it 
was an experiment to test the effect of 
expectation on task performance. Some 
Ss were then asked if they would mind 
acting in a deception for the next couple 
of minutes since the person regularly 
employed for this was away. The Ss of 
one group were hired for $1.00 each, 
those of another group for $20.00 each, 
to tell the next incoming S how enjoy- 
able and interesting the experiment had 
been (ostensibly the expectation vari- 
able). Each S was also told he might be 
called on to do this again. Some Ss 
refused to be hired. A control group of 
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Ss was not asked to take part in any 
deception. Subsequently, all Ss (control 
and hired Ss) were seen by a neutral 
interviewer, supposedly as part of the 
psychology department’s program of 
evaluating experiments. During the 
interview, Ss were asked to rate the ex- 
periment along four dimensions. The 
only significant difference between the 
three groups was in terms of enjoyment. 
The control group rated the experiment 
as just a little on the dull side, the $1.00 
group thought it was somewhat enjoy- 
able, and the $20.00 group was neutral. 
The mean ratings for the control and 
$20.00 groups were not significantly dif- 
ferent from each other nor from the 
neutral point. 

What was this experiment about? The 
authors call it “Cognitive Consequences 
of Forced Compliance.” They make the 
prediction that “the larger the reward 
given to the S” the smaller the disso- 
nance and therefore “the smaller the 
subsequent opinion change,” and “fur- 
thermore the observed opinion 
change should be greatest when the pres- 
sure used to elicit the overt behavior is 
just sufficient to do it.” As an aside we 
should point out that, inasmuch as these 
statements clearly refer to a maximum 
and so by inference to some sort of a 
curvilinear or nonmonotonic relation- 
ship, it would have been better if more 
reward categories had been used. In 
addition, two more control groups—a 
deception-but-no-reward group, and a 
reward-but-no-deception group—should 
have been included to separate out the 
effects of reward and deception. How- 
ever, our primary concern at the moment 
is not with such technical matters of ex- 
perimental design. ; 

Let us examine instead the meaning 

of the descriptive term “forced com- 
pliance.”’ According to Festinger 
(1957), it means “public compliance 
without private acceptance Ip. 87]. 


The reward Ss, it is true, complied 
publicly with the instructions in that 
they described a boring task as enjoyable 
to another S. Notice, however, that even 
the control group rated the task as only 
slightly boring. This suggests that the 
false explanation placed the task in a 
wider context and may have led to “pri- 
vate acceptance” of the whole situation 
by both control and reward Ss. We 
could also question the choice of the 
word “forced.” Forced implies a lack 
of freedom, but it is extremely difficult 
to predict how an S perceives his free- 
dom of choice even when this variable 
is experimentally manipulated (e.g., 
Brehm & Cohen, 1959a). All we can say 
is that the term forced compliance is not 
a good description of the events in this 
experiment. 

What seems to be even more impor- 
tant, however, is that the experiment 
could be more appropriately entitled 
“The Effect of a Plausible and Implausi- 
ble Reward on Task Evaluation.” As 
far as we can tell, Ss were not asked to 
describe their reactions to the size of 
the reward. Nevertheless, $20.00 is a 
lot of money for an undergraduate even 
when it represents a whole day’s work. 
When it is offered for something that 
must be much less than 30 minutes work, 
it is difficult to imagine a student accept- 
ing the money without becoming wary 
and alert to possible tricks. In fact, 
more than 16% of the original Ss in the 
$20.00 group had to be discarded be- 
cause they voiced suspicions, or refused 
to be hired. Under such circumstances, 
it seems likely that those who were 
retained might have hedged or been 
evasive about their evaluation of the ex- 
periment. The mean rating for the 
$20.00 group was —.05 on a scale that 
ranged from —5.00 (dislike) to 5.00 
(like), that is, the mean rating was at 
the neutral point. As other workers 
(e.g., Edwards, 1946) have suggested. 
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a rating at the zero or neutral point may 
be ambiguous, ambivalent, or indifferent 
in meaning and may simply represent 
an evasion. The authors’ data, unfortu- 
nately, do not permit us to determine 
whether individual Ss did in fact respond 
this way. In any event, if we assume 
that $1.00 is a plausible, but $20.00 an 
implausible, reward, then the results fall 
neatly into the pattern of all previous 
and more extensive experiments on the 
effect of credulity on pressures to con- 
formity (Fisher & Lubin, 1958). 

To sum up, the design of this experi- 
ment does not allow us (a) to check 
whether discrepant cognitions were in 
fact produced, and (b) to rule out alter- 
native explanations. 

Incidentally, the authors of other 
related studies (Brehm, 1960; Brehm & 
Lipsher, 1959; Cohen, Terry, & Jones, 
1959) have difficulty in accounting for 
all of their results according to disso- 
nance theory predictions. These diffi- 
culties disappear if we use a plausibility 
explanation. The argument would pro- 
ceed along these lines: If an individual 
is subjected to many pressures towards 
change from a number of different 
sources, (a) each pressure will act on 
the individual, and (b) their effect will 
be cumulative. For instance, we may 
increase pressure on an individual by 
limiting his freedom of choice, by giving 
him acceptable rewards, by presenting 
him with statements that strongly sup- 
port a position discrepant to his own, by 
increasing the size of the discrepancy, 
and so on. Each of these alone will pro- 
duce a greater and greater opinion 
change until—and this is the critical 
part of the argument—the situation 
becomes implausible, at which point the 
S will ignore the pressures and show no 
change. It also seems reasonable to sup- 
Pose that a combination of any of these 
factors will act cumulatively to produce 
the implausible effect. 


We can express this situation in 
statistical terminology. For example, 
if we have a two-factor experiment with 
two levels of pressure towards opinion 
change in each variable, we would ex- 
pect to find that the two main effects 
are significant. Moreover, we would 
predict that the interaction would also 
be significant primarily because the com- 
bination of both “high pressures” would 
be implausible and so produce the least 
opinion change. In general, this is the 
pattern of results obtained by the dis- 
sonance workers in experiments of this 
type. 

Mealtime Troubles. Another example 
of an untested interpretation occurs in 
the Brehm (1959) experiment on the 
effect of a fait accompli in which boys 
were offered a prize if they ate a portion 
of a disliked vegetable. While eating it, 
some Ss were casually told that a letter 
would be sent to their parents informing 
them of their participation in the ex- 
periment and of the vegetable they ate. 
Those boys who indicated they had 
trouble about eating the vegetable at 
home (i.e., it was more often served 
than eaten) subsequently changed their 
rating of the disliked vegetable towards 
a more favorable one. What did the 
letter mean to these boys? According 
to the author it meant that “the Ss 
would have to eat more” of the vegetable 
at home. But this is a guess, not based 
on any evidence in the experiment. 
Furthermore, in an extension of the 
same procedure at the same school with 
equivalent Ss from the same classes, 
direct manipulation of the commitment 
to further eating “failed to produce an 
overall effect on liking [Brehm, 1960, 
p. 382].” Under the circumstances, we 
find it difficult to accept the author’s 
contention that the fait accompli in- 
creased cognitive dissonance by increas- 
ing the commitment to eating. There 
is little doubt that mentioning the letter 
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changed the ratings, but only for boys 
who had mealtime troubles. The key 
to the problem most likely lies in the ex- 
pectation these boys had about the effect 
of the letter on their parents and on 
themselves. However, the design of the 
experiment does not allow us to find out 
what this expectation was. 

Confounded Effort. In a recent ex- 

periment, Aronson (1961) tried to 
separate the effects of secondary rein- 
forcement from dissonance in a reward- 
ing situation. 
Reinforcement theory suggests that stimuli 
associated with reward gain in attractiveness; 
dissonance theory suggests that stimuli asso- 
ciated with “no reward” gain in attractiveness 
. .. if a person has expended effort in an at- 
tempt to attain the reward [p. 375]. 


Aronson argues that since the effect of 
secondary reinforcement is constant, 
nonrewarded objects should become 
more attractive as the effort to obtain 
them increases. 

In order to test this hypothesis, Ss 
fished for cans to obtain a reward ($.25) 
inside one third of the cans. The re- 
warded cans were of one color, the non- 
rewarded ones of another, but the Ss 
could not determine which they had 
snared until the cans had actually been 
pulled out. One group of Ss—the low- 
effort group—was told that their task 
was not tedious. They had the relatively 
easy task of fishing out a can with a 
magnet, a task which took them, on the 
average, only 14 seconds per can, 
Another group of Ss—the high-effort 
group—was told that their task was ex- 
tremely tedious. They had the relatively 
difficult task of fishing out a can with 

a hook, which took them, on the average, 
52 seconds per can. All Ss continued 
fishing for the reward money until 16 
unrewarded cans had been pulled out. 
The Ss rated the relative attractiveness 
of the two colors before and after carry- 
ing out the task. The results show that 


in the low-effort condition, the attrac- 
tiveness of the color on the rewarded 
cans increased (a secondary reinforce- 
ment effect), but in the high-effort con- 
dition no change was observed. All of 
this was interpreted as substantiating 
the cognitive dissonance predictions. 

Aronson explains the lack of change in 
the high-effort condition by saying that 
the effects of dissonance and secondary 
reinforcement are equal but opposite in 
direction, and so cancel each other. 
However, if we look more carefully at 
the experimental manipulation of ef- 
fort, we see that the low-effort condi- 
tion is actually a reward rate of $.25 
about every 42 seconds, and the high- 
effort condition is actually a reward rate 
of $.25 about every 156 seconds. In 
other words, the low-effort group is, at 
the same time, a high-reward-rate group, 
and the high-effort group, a low-reward- 
rate group. The difference obtained be- 
tween the two groups could then be 
simply the result of the difference in 
reward rates, and the lack of change in 
the high-effort group, the result of their 
low reward rate. 

To summarize, Aronson tried to dem- 
onstrate the effect of effort in a reward- 
ing situation. However, the design of 
the experiment confounds effort with 
reward rate. As a result, no unambigu- 
ous conclusions can be drawn as to the 
effect of effort. 

There is yet another experiment on 
effort in which confounding occurs. 
Yaryan and Festinger (1961) tried to 
show the effect of “preparatory effort’ 
on belief in a future event. The Ss volun- 
teered to participate in an experiment 
labeled “Techniques of Study” which 
was supposed to investigate the tech- 
niques, hunches, and hypotheses that 
students use to study for exams. The Ss 
were told that only half of them would 
take part in the complete experiment 
which involved taking an IQ test. All Ss 
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were given an information sheet on 
which there were definitions essential to 
this supposed IQ test. In the high-effort 
condition, Ss were told to study the sheet 
and memorize the definitions. In the 
low-effort condition, Ss were told to 
glance over the definitions briefly. The 
latter were also told that they would 
have access to the sheet later if they 
were to take the IQ test. Each S was 
then asked to express his estimate of the 
probability that he would take the IQ 
test. The results show that Ss in the 
high-effort group thought it was more 
probable that they would take the test. 

The authors (Yaryan & Festinger, 
1961) explain the results in the follow- 
ing way: Exerting “a great deal of 
effort” is inconsistent with the cognition 
that one may not take the test, so Ss 
in the high-effort group should believe 
“more strongly in the likelihood of the 
occurrence of the event [p. 606].” This 
might very well be the case, but in this 
experiment the variable of effort is con- 
founded with the presence of other pre- 
dictors for the event. All Ss had been 
told that this was an experiment on the 
techniques of study, but the only group 
which did any studying was the high- 
effort group. In addition, the studying 
that was done was highly relevant for the 
IQ test. Under the circumstances, it 
does not seem at all surprising that Ss 
in this group took these additional cues 
to mean that they were assigned to the 
complete experiment and to the IQ test. 
As it stands now, the Yaryan and Fes- 
tinger experiment does not separate the 
effect of effort from that of additional 
cues, 

Reliability Is Not Validity. As we 
have seen, most cognitive dissonance 
formulations are concerned with what 
happens after a person makes a decision. 
One of the earliest experiments de- 
Signed specifically to investigate this 
problem was the gambling experiment 


described by Festinger (1957, p. 164) 
and successfully replicated by Cohen, 
Brehm, and Latané (1959) with minor 
variations in procedure. The agreement 
between these two studies has done 
much to enhance the belief in the va- 
lidity of cognitive dissonance formula- 
tions (e.g., Riecken, 1960, p. 489). 
The experimental procedure in these 
two studies was relatively simple. Each 
S played a card game with the experi- 
menter for variable money stakes. Be- 
fore beginning the game each S was 
informed of the rules of the game, and 
on the basis of this information chose 
one of two sides on which to play. He 
was told that he could change sides once 
during the game, but that it would cost 
him money. He was led to expect that 
he would play 30 games. At the end of 
12 games, play was interrupted and S 
was given a probability graph to study. 
The graph, a different one for each side, 
gave the (false) information that the 
chosen side was the losing one. The de- 
pendent variables were the time spent 
in studying the graph and the number 
of people who changed sides. Results 
were analyzed in terms of a weighted 
average of the amount of money won or 
lost. The pattern of results obtained is 
very complex and would require at least 
a fourth-order parabola to describe it. 
Nevertheless, the various ups-and-downs 
were interpreted as supporting the 
dissonance theory predictions for postde- 
cision, information seeking processes. 
Two things strike us about the dis- 
sonance theory interpretation of this 
experiment. First, Festinger is not con- 
sistent in his dissonance formulations. 
Let us look at the way in which the re- 
sults are interpreted. The money win- 
ners spent a moderate amount of time 
studying the graph. Festinger considers 
this the result of dissonance produced 
by the information in the graph which 
purported to show that these winners 
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were actually on the losing side. If we 
accept this line of reasoning, it should 
follow that the losers would have no 
such dissonance (the graph confirmed 
their losses), and would therefore spend 
little time on the graph. This was not 
so. Festinger (1957), however, has three 
other dissonance explanations to account 
for the complex behavior of the losers. 
He argues, first of all, that loss of money 
is itself dissonance producing and the 
bigger the loss, the bigger the disso- 
nance. The small losers spent as much 
time as they did in the “hope that the 
graph would tell them they were actually 
on the correct side [p. 171].” If this 
explanation is correct, the bigger losers 
should have spent an even longer time 
searching the graph—but they did not. 
Festinger explains this away by saying 
that the bigger losers would avoid the 
graph “if the graph were perceived as 
yielding information which would proba- 
bly increase the dissonance which al- 
ready existed [p. 172].” If this ex- 
planation is correct, the biggest losers 
should have spent the least amount of 
time on the graph—but they did not. 
To explain this behavior Festinger pos- 
tulates yet another hypothesis. For the 
biggest losers “the easiest way to elimi- 
nate the dissonance would be to increase 
it temporarily to a point when it was 
greater than the resistance to change of 
the behavior,” that is, they would study 
the graph, then switch sides. If this 
explanation is true, then we would ex- 
pect that all of the biggest losers, and 
only the biggest losers, would switch 
sides—but this was not so. It should be 
noted that these four dissonance hy- 
potheses are inconsistent with each 
other, since they predict effects in dif- 
ferent directions. Moreover, there is no 
a priori way of determining the degree 


to which each particular hypothesis ap- 
plies to the groups. This whole matter 


can be summarized in another way: If 


the pattern of results had been exactly 
the reverse, these same explanations 
would apply just as well. 

This brings us to the second point. 
The most important criticism of this 
gambling experiment is that it is not so 
much an experiment on the dissonance- 
reducing effects of information in post- 
decision processes, as it is an experiment 
on “information seeking in predecision 
processes” (suggested by F. E. Emery). 
The Ss had been told that they could 
change sides and they were actually 
given an opportunity to do so when they 
were handed the graph. Festinger 
(1957) and Cohen et al. (1959) re- 
ported that many Ss, both winners and 
losers, announced their decision to 
switch sides at this time. What was not 
reported, however, was the number of 
Ss who looked at the graph in order to 
reach a decision whether or not it would 
be more profitable to change sides. In 
other words, Ss looked at the graph not 
to reduce dissonance, but to look for 
information to help them decide whether 
they should change sides. With this in- 
terpretation, the pattern of results be- 
comes more obvious and reasonable, For 
example, in one of the two conditions 
of the Cohen et al. replication those Ss 
who neither won nor lost any money 
spent the most time looking at the 
graph. This finding is entirely inex- 
plicable in dissonance terms, even with 
an imaginative use of all four of Fes- 
tinger’s hypotheses. However, if we con- 
sider this experiment to be concerned 
with predecision processes, then we see 
that these Ss had gained the least 
amount of information from the actual 
play of the game and were, therefore, 
trying to extract as much information 
as possible from the graph before reach- 
ing a decision. 

Taking all of these factors into con- 
sideration, we are forced to conclude that 
Festinger’s interpretations, however in- 
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genious, are unnecessarily elaborate 
and unjustified. Moreover, the success- 
ful replication of the experiment sug- 
gests—not that the cognitive dissonance 
formulations are valid—but only that 
the results of experiments of this type 
are reproducible. 

Subsequent experiments on selected 
aspects of information seeking in post- 
decision processes have done nothing to 
clarify the situation. Adams (1961) and 
Maccoby, Maccoby, Romney, and Adams 
(1961) showed that people tend to seek 
information which agrees with their 
viewpoint, but Rosen (1961) obtained 
results which show that people tend to 
seek information which disagrees with 
their viewpoint. It seems more likely 
to us that, in general, people will often 
seek new information, whether it be con- 
sonant or contrary. Indeed this is ap- 
parently the kind of result that Feather 
(1962) obtained. To be sure, when 
prejudice or some other highly motivated 
state is involved, people are selective 
in their perceptions and avoid contrary 
material. But under these conditions 
it is the motive itself, and not disso- 
nance, that seems to be crucial. 


Interpretation of Manipulations 


Perhaps the illustrations cited above 
will suffice to show that the experiments 
adduced to support the theory of cogni- 
tive dissonance involve highly complex 


Cognition 
+ 
Cognition 


EXPERIMENTAL a 
MANIPULATIONS 


ee 


11 


manipulations. The effects of these ma- 
nipulations are open to alternative expla- 
nations which have generally not been 
dealt with adequately by the authors. 
We can diagram this in the following 
way: Let us suppose that the complex 
experimental manipulations produce the 
Cognitions 1, 2, 3,4 . . . z in the S, as 
illustrated on the left side of Figure 1. 
Two of these cognitions (Cognitions 1 
and 2) are chosen by the experimenter 
as being the relevant discrepant cogni- 
tions producing dissonance. Any ob- 
served change in the dependent variable 
is then attributed to that dissonance. 
But, as we see by examining the whole 
of Figure 1, this may not necessarily be 
the case. Any one cognition, or any 
combination of cognitions, could have 
been responsible for the change in the 
dependent variable. There is no way of 
ascertaining which, because the effects 
of all these cognitions have been con- 
founded. 

It is possible to design experiments so 
that these effects are not confounded. 
As a step in this direction we recom- 
mend first of all that the experimental 
manipulations be simplified. It is diffi- 
cult to agree about differences in cogni- 
tions when the instructions, task, and 
procedure differ in many ways for con- 
trol and experimental groups. Our sec- 
ond recommendation is that additional 
control groups be included in the design 
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The type of confounding frequently 


found in experiments on cognitive dissonance. 
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of these experiments to deal with the 
irreducible differences in experimental 
manipulation. Our third recommenda- 
tion is that a little more attention be 
given to discovering the possible cogni- 
tions that an S might have about the 
situation, particularly those which might 
be contrary to dissonance theory. Only 
under such carefully controlled condi- 
tions can we begin to talk about un- 
equivocal evidence in support of cogni- 
tive dissonance theory. 


CONTROVERSIAL TREATMENTS OF 
THE DATA 


So much for experimental manipula- 
tions. Now to see if the experimenter 
really got the results he says he did. 
Our most serious criticisms of the ex- 
periments cited in support of dissonance 
theory fall under the heading of meth- 
odological inadequacies in the analyses 
of results. Of these inadequacies the 
most important is the rejection of cases, 
not only because it is so fundamental a 
flaw, but also because the supporters of 
dissonance theory so often do it. 


Rejection of Cases 


If an experimenter is interested in the 
performance of only a certain group of 
Ss, it is legitimate for him to select these 
Ss before beginning the experiment, or 
sometimes even after the experiment, 
before the results are analyzed. How- 
ever, when Ss are selectively discarded 
after the data of an experiment have 
been collected, tabulated, and sometimes 
even analyzed, it leaves the reader with 
a feeling of uneasiness, The uneasy feel- 
ing grows if the Ss are discarded because 
their results are said to be “unreliable,” 
or if the experimenter gives inconsistent 
reasons—or no reasons at all—for their 
rejection. But let us look at the experi- 
ments themselves. 
Unreliable Ss. Brehm and Cohen 


(1959b) asked sixth-grade children to 


indicate how much they liked several 
different toys before and after they had 
chosen one for themselves. The choice 
of the gift and the postchoice rating 
were made a week after the prechoice 
rating. The authors hypothesized that 
there would be an increase in the evalua- 
tion of the chosen article, and a decrease 
in the evaluation of the nonchosen arti- 
cle, the greater the dissimilarity among 
the toys, and the greater the number of 
alternatives from which to choose. In 
general these predictions were upheld. 
But of the original sample of 203 chil- 
dren only 72 were used in the analysis. 
In the authors’ (Brehm & Cohen, 
1959b) words, the reasons for the re- 
duction were as follows: 

First, the choice alternatives for each S had 
to be liked, but not so much that an increase 
in liking would be impossible. Second, one 
alternative had to be initially more liked than 
any other so that its choice could be expected. 
Increased ratings of the chosen item are thus 
not likely to be simply a result of normal (and 
random) changes in actual liking from the 
first questionnaire to the second. In addition, 
Ss who failed to choose the alternative initially 
marked as most liked, were excluded because 
they gave unreliable or invalid ratings. Finally, 
in order to ensure that initially less liked 
alternatives were seriously considered as pos- 
sible, initial ratings of these alternatives could 
not be much lower than the most liked alter- 
native [p. 375; italics added]. 

Note first that the exact limits of all 
these requirements were determined only 
after inspection of the data, despite the 
fact that each S had been given a pre- 
arranged choice based on his initial rat- 
ings. However, let us look more care- 
fully at the italicized item—the exclu- 
sion of Ss because of unreliability. If 
Ss give unreliable results, it is usual to 
assume that the measuring instrument 
itself is unreliable; indeed, the authors 
themselves admit this when they men- 
tion “the low reliability of our measure 
of liking.” However, discarding se- 
lected Ss does nothing to improve the 
reliability of an instrument. 
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Discarding Ss who did not choose the 
alternative initially marked as most 
liked may in fact falsely reduce the 
computed error variance, change the 
mean values, and so enhance the possi- 
bility of obtaining a significant differ- 
ence in rating. To illustrate, the upper 
half of Figure 2 shows the ratings for 
two toys, X and Y, which satisfy the 
conditions specified by Brehm and 
Cohen: they are both liked, one is 
liked more than the other, but the differ- 
ence is not great. Now let us assume 
that the ratings are subject to errors of 
measurement and that they vary ran- 
domly from time to time. Let us further 
assume that the expressed rating, the 
liking, at any one moment in time, is 
perfectly correlated with choice. 

The situation a week later is shown in 
the bottom half of Figure 2. The ratings 
for X are now distributed as X’, the 
ratings for Y, as Y’. Now the Ss are 
asked to make a choice. Let us assume 
the null hypothesis, that is, the actual 
process of choosing a gift does not alter 
the liking or rating of a toy. Since the 
choice and the postchoice rating occur 
so close together in time we can also 
assume that no random change occurs 
from just before the choice to just after 


it. If, as Brehm and Cohen did, we 
discard all those Ss who chose Y rather 
than X, this means that we eliminate 
from the shaded area in the bottom half 
of Figure 2 all those cases in which 
Y’>X’. Such a process can only reduce 
the variance of both distributions of dif- 
ferences in ratings (that is, X’—X and 
Y’—Y), and automatically increase the 
mean difference between them. More- 
over, this selection procedure will auto- 
matically produce precisely the effect 
which the authors predicted, namely, 
that the mean rating for the chosen toy 
increases, while the rating for the non- 
chosen toy decreases. Furthermore, these 
effects will be greatest in the group from 
which most such Ss were discarded. Most 
of these discards came from the “four al- 
ternatives condition,” è and the change 
in rating is actually greatest for this 
condition. 

In a footnote (p. 376) the authors 
(Brehm & Cohen, 1959b) state they 
carried out similar computations on their 
“unselected population,” that is, on the 
entire sample of 203 Ss. Although the 
sensitivity of the statistical test is now 
nearly twice as great (because of the 
increase in N from 72 to 203), they find 


2J. Brehm, personal communication, 1961. 
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no effect due to the number of alterna- 
tives (one of the two predictions made). 
Tests of the other prediction “yield sup- 
port” for the “dissimilarity hypothesis.” 
It is not clear, however, whether the 
authors mean by this a statistically sig- 
nificant difference, or simply a nonsig- 
nificant trend. To sum up, it seems 
reasonable to conclude that the signifi- 
cant results obtained in this and similar 
experiments may very well be statistical 
artifacts. 

Contradictory Reasons. Sometimes it 
is difficult to reconcile the reasons given 
for the rejection of cases with other 
statements by the same author. For ex- 
ample, in the experiment on “Attitude 
Change and Justification for Compli- 
ance” (Cohen, Brehm, & Fleming, 
1958), the initial analysis showed no sig- 
nificant difference between the two justi- 
fication groups. The authors then elimi- 
nated more than half of the Ss (47 out 
of 92), carried out a second analysis on 
the remainder, and concluded that “the 
difference in amount of change is sig- 
nificant by one-tailed ¢ test at .07 level.” 
Relatively more Ss whose opinion did 
not change were eliminated from the 
low-justification condition (35 out of 
63). Not surprisingly, the new mean 
for the low-justification condition turned 
out to be greater than for the high-justi- 
fication condition. 

Part of the reasoning for this selec- 
tion of cases was as follows (Cohen et 
al., 1958): “since extremity of position 
inhibits attitude change . . . it seems 
reasonable to eliminate the extremes [p, 
277].” A year later, however, Cohen 
(1959) made this statement: “If the 
individual . . . engages in some be- 
havior with regard to the contrary com- 
munication . . . then the greater the 

discrepancy [extremity of position], the 
greater the opinion change [p. 387].’ 
In their original article, Cohen et al. 


state that their results should be in- 


terpreted cautiously, but, unfortunately, 
they do not follow their own advice. 
Whenever these authors refer to their 
findings in later articles (e.g., Brehm, 
1960; Cohen, 1960), they quote their 
results as substantiating cognitive dis- 
sonance theory without any of these 
cautionary reminders, 

What Is Going On? An example of 
sample reduction for obscure reasons 
occurs in an experiment on the reader- 
ship of “own car” and “other car” ad- 
vertisements by new and old car owners 
(Ehrlich, Guttman, Schonbach, & Mills, 
1957). A group of 65 new car owners 
was randomly chosen from a list of re- 
cent auto registrants. The car adver- 
tisements read by this group were com- 
pared with those read by a group of 60 
old car owners chosen from a telephone 
directory. The raw data for these analy- 
ses were the percentages of car adver- 
tisements noticed and read in a selection 
of magazines and newspapers which the 
owners had previously indicated they 
read regularly. The cognitive dissonance 
theory predictions were that new car 
owners would most often read advertise- 
ments about their own make of car and 
avoid reading those of competing makes. 
In general, these predictions were up- 
held for the data presented. 

The principal difficulty with this ex- 
periment is that cases were successively 
rejected in various stages of the analysis 
so that when one finally arrives at the 
critical statistical test it is virtually im- 
possible to determine what the remain- 
ing data mean. Let us see if we can 
trace the authors’ steps in this process. 
The authors first present a table showing 
the mean percentage of advertisements 
noticed and mean percentage of adver- 
tisements read of those noticed for each 
of the categories “own,” “considered,” 
and “other car.” They (Ehrlich et al., 
1957) state in a note accompanying the 
table that: 
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The N’s are reduced because in some cases no 
advertisements of a particular kind appeared 
in the issues shown or none of those which 
appeared were noticed. They are further re- 
duced because not all respondents named cars 
as “seriously considered” [p. 99]. 


The first and third reasons impose a 
limit on the number of Ss whose results 
could be used. The largest reduction 
due to these two limitations was in the 
category “considered car” for old car 
owners, where the V of 60 was reduced 
to 31. 

The second reason given in the quota- 
tion above means that an owner who did 
not notice any advertisement in a par- 
ticular category was discarded from the 
table of “advertisements read” for that 
category. For example, if an owner no- 
ticed (or noticed and read) an ad about 
another car but did not notice any ad- 
vertisements about his own car, he was 
included under the category of “other 
car,” but excluded from the category of 
“own car” in computing the mean per- 
centage of “advertisements read of those 
noticed.” Up to one third of the remain- 
ing cases were eliminated from the vari- 
ous categories for this reason. 

The next point at which still more 
cases are rejected is in the computation 
of several sign tests of significance. We 
are told that the Ns are reduced because 
not all comparisons were possible. What 
this means is that significance tests were 
computed only on those owners who 
noticed at least one advertisement in 
each of the pairs of categories compared. 
Finally, those owners who read an equal 
percentage of advertisements in each of 
the two categories were also discarded. 

Taking all of the above factors into 
account we find that as much as 82% of 
the original sample was discarded in 
certain categories! 

The sign tests mentioned above were 
used only for making certain pairs of 
comparisons. For overall tests of their 
hypotheses the authors resorted to chi 


square and give terminal chi square 
values, with their associated probabili- 
ties, alongside the tables for the sign 
tests. The article itself does not say 
upon what Ws, or what groupings, the 
chi squares were computed, but corre- 
spondence * reveals that the chi square 
tests were made on the same Ss as were 
used in computing the sign tests. 

At best this entire situation may be 
described as unclear. In the first place, 
it is difficult to know how to interpret 
significance tests based on such highly 
selected data. Furthermore, in comput- 
ing chi squares for the same Ss as were 
used in the sign tests, it appears that 
the authors discarded some data (the 
ties) which should properly have been 
included. If we have been able to thread 
our way correctly through the authors’ 
manipulations of the data we find that 
the chi squares, computed for all the 
relevant data, are less significant than 
reported by the authors, and, in two of 
three cases, change a nominally signifi- 
cant value to a nonsignificant one. In 
any event, there can be no doubt that 
the authors’ (Ehrlich et al., 1957) sum- 
mary statement “It was found that new 
car owners read advertisements of their 
own car more often than .. .” needs 
considerable qualification. With so much 
selection of Ss and with such intricate 
manipulations of the data, some of it 
never fully explained, one can hardly 
describe the results as public, or the 
findings as necessarily significant. 

Manipulation Not Successful. Still an- 
other type of rejection we find in these 
studies is the elimination of entire 
groups of Ss. If one variant of the 
manipulation fails to show an effect, it 
is not legitimate to discard all the Ss 
in that group from the analysis. The 
analysis should properly be carried out 
on all the data and the interpretations 
should be based on the complete analy- 


J. Mills, personal communication, 1962. 
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sis. Brehm (1960), for example, used 
reports on the vitamin and mineral con- 
tent of vegetables to try to influence the 
attitudes of Ss after they had eaten a 
disliked vegetable. One group of Ss re- 
ceived the vitamin report, the other 
group the mineral report. Since the 
mineral report “failed to affect” the de- 
pendent variable “the results for these 
subjects [were] omitted from this report 
[Brehm, 1960, footnote, p. 380].” One 
consequence of rejecting an entire group 
is that we do not know if there is a sig- 
nificant interaction between type of re- 
port and the other variables. Until this 
is established, it is misleading to con- 
sider a segment of the results as signifi- 
cant. In addition, the author nowhere 
states that his findings are specific to 
one type of report only. His summary is 
in terms of “communications about food 
value.” 
Reallocation Instead of Rejection. An 
interesting variant of the rejection of 
Ss occurs in the Raven and Fishbein 
(1961) study on the effect of “Accept- 
ance of Punishment and Change in Be- 
lief.” Groups of Ss were run under two 
conditions, “shock” and “no-shock.” 
There were 13 females and 13 males in 
each of these two conditions. The re- 
sults show that there was no overall 
difference between the shock and no- 
shock groups. However, when the re- 
sults were tabulated separately for the 
two sexes, it appeared that the female Ss 
in the shock group changed in the pre- 
dicted direction, but that the male Ss 
did not. Here is how the authors (Raven 
& Fishbein, 1961) dealt with the situa- 
tion: 
Overall analysis of variance and interaction 
was not significant. Assuming that male shock 
Ss were part of a common population with 
the non-shock subjects, with respect to dis- 
sonance, an analysis of variance was conducted 
which showed the female shock subjects to be 
significantly different from the others [p. 415]. 


In other words the authors disposed of 


the Ss who did not conform to their 
prediction, not by rejecting them, but 
by reallocating them to another group, 
the no-shock group. If females in the 
shock group really are significantly dif- 
ferent from all the others, this should 
show up in a significant interaction. It 
does not. 

Rejection of cases is poor procedure, 
but reallocation of Ss from experimental 
to control group, across the independent 
variable, violates the whole concept of 
controlled experimentation. 

Danger of Rejecting Ss. A theme 
common to many of these rejections is 
that the unselected sample “does not 
permit of an adequate test” of the dis- 
sonance hypothesis. We are told: 

In a social influence situation there are a 
number of potential channels of dissonance 
reduction, such as changing one’s own opinion, 
changing the opinion of the communicator, 
making the communicator noncomparable to 
oneself, seeking further support for one’s posi- 
tion, dissociating the source from the content 
of the communication, and distorting the 
meaning of the communication [dissonance 


theory position ably summarized by Zimbardo, 
1960, p. 86]. 


Such a theoretical formulation is indeed 
all-encompassing and it provides a ra- 
tionale which certain other dissonance 
theory workers have used for rejecting 
cases, The reasoning goes like this: If 
some Ss do not follow the specific pre- 
dictions in a particular experiment (for 
instance, if they fail to show any opin- 
ion change) then those Ss are probably 
reducing their dissonance through some 
other channel or else they had little dis- 
sonance to begin with. If either of these 
conditions holds it is legitimate to ex- 
clude these Ss from the analysis since 
they could not possibly be used to test 
the particular hypothesis in the experi- 
ment. An inspection of results is con- 
sidered sufficient to determine whether 
Ss are, or are not, to be excluded. Un- 
fortunately, this line of reasoning con- 
tains one fundamental flaw: it does not 


COGNITIVE DISSONANCE: FIVE YEARS LATER 17 


allow the possibility that the null hy- 
pothesis may be correct. The experi- 
menter, in effect, is asserting that his 
dissonance prediction is correct and that 
Ss who do not conform to the prediction 
should be excluded from the analysis. 
This is a foolproof method of guaran- 
teeing positive results. 

Some people may feel that no matter 
how questionable the selection proce- 
dure, it must still mean something if it 
leads to significant results. This point of 
view, however, cannot be reconciled with 
the following fact of life: it is always 
possible to obtain a significant difference 
between two columns of figures in a 
table of random numbers provided we 
use the appropriate scheme for rejecting 
certain of those numbers. For all we 
know, selecting Ss so as “to permit an 
adequate test of the hypothesis” may 
have had precisely this effect. A signifi- 
cance test on selected Ss may therefore 
be completely worthless. 


We strongly recommend that Ss not 
be discarded from the sample after data 
collection and inspection of the results. 
Nor is it methodologically sound to re- 
ject Ss whose results do not conform to 
the prediction on the grounds that they 
have no dissonance, or that they must be 
reducing it some other way. If there are 
any theoretical grounds for suspecting 
that some Ss will not show the predicted 
dissonance-reduction effect, the char- 
acteristics of such Ss, or the conditions, 
should be specifiable in advance. It 
should then be possible to do an analysis 
on all Ss by dividing them into two 
groups, those predicted to show disso- 
nance reduction, and those predicted 
not to show it. If such a thing as dis- 
sonance reduction exists, it is theoreti- 
cally and practically important to know 
the precise conditions under which it 
does and does not occur. 

A summary of experiments in which 
Ss are rejected is given in Table 1. 


TABLE 1 


List OF EXPERIMENTS 


rrom Waca Ss WERE DISCARDED AFTER Data COLLECTION 


Reasons given 


Discarded 
neni aN 
Experiment Total (Jo) 
Brehm (1956) 225 35 
Brehm (1960) 85° 38° 
Brehm & Cohen (1959b) 203 65 


To permit adequate test of hypothesis 
1. Unreliable Ss 
2. Conditions not fulfilled 

One manipulated condition not significant 

To permit adequate test of hypothesis 
1. Ceiling effect for high scorers 
2, Adequate separation of choice points 

for dissonance to occur 

3. Unreliable Ss 

None 

To permit adequate test of hypothesis 
1, Extremity of attitude inhibits attitude 

change 

Material missing 

Advertisements not noticed 

Not all comparisons possible 

Ties 

To permit adequate test of hypothesis 


1. Ceiling effect for high scorers 
2. Honest improvers have no dissonance 


ponp 


Brehm & Lipsher (1959) 114 10-14 

Cohen, Brehm, & Flem- 92 51 
ing (1958) 

Ehrlich et al. (1957) 125 17-82 

Mills (1958) 643 30 
` Estimated. 7 ap 
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Refusals 


The previous section has been con- 
cerned with sampling bias due to the 
deliberate rejection of cases by the ex- 
perimenter. There is another type of 
sampling bias, equally important but 
much more subtle, that occurs when Ss 
reject themselves from the study by re- 
fusing to participate. 

In a recent review of cognitive disso- 
nance experiments, Cohen (1960) con- 
cluded with what he considered was a 
“depressing” and “Orwellian?” state- 
ment: 

It could be said that when the individual feels 
that he has most freedom of choice, when his 
volition and responsibility are most engaged, 
he is then most vulnerable to the effects of 
persuasive communications and to all sorts of 


controlled inducements from the world at 
large [p. 318]. 


This statement follows hard on the heels 
of “the more negative the person is 
toward a communication or communica- 
tor, the more he can be expected to 
change his attitudes in the direction of 
the communication or communicator.” 
These are indeed sweeping generaliza- 
tions, particularly since they are based 
on the results of experiments in which 
from 4% (Cohen, Terry, & Jones, 1959) 
to as many as 46% (Rabbie, Brehm, 
& Cohen, 1959) of the total number of 
Ss refused to participate. Moreover, 
there is evidence in these studies that 
the Ss who refused to participate were 
actually those who had both the great- 
est freedom of choice and the strongest 
(most negative) views on the attitude in 
question. What actually appears to have 
happened is that those Ss with the 
strongest (most negative) views were so 
invulnerable to the effects of persuasive 
communications that they exercised 
their freedom of choice by walking out 
on the experimenter or refusing to com- 
ply in other ways. To take the results 


of the remaining more vulnerable Ss and 
extrapolate from them to the population 
in general seems unjustified. 


Inadequate Design and Analysis 


It is rare to find in this area a study 
that has been adequately designed and 
analyzed. In fact, it is almost as though 
dissonance theorists have a bias against 
neat, factorial designs with adequate Ns, 
capable of thorough analysis either para- 
metrically or nonparametrically. The 
majority of their experiments are some 
variant of the 2x2 factorial with un- 
equal, nonproportional, and generally 
small Ns in each cell. These restric- 
tions make it impossible for the authors 
to carry out ordinary analyses of vari- 
ance. Instead we find them making use 
of a hodgepodge of ¢ tests and a statistic 
which they refer to as an “interaction ¢” 
(Walker & Lev, 1953, pp. 159-160). 

Making a number of ordinary ¢ tests 
on the same set of data, without a prior 
overall test of the null hypothesis, can 
be misleading. The principal difficulty 
is that in making such multiple com- 
parisons the experimenter is allowing 
himself a number of opportunities to 
find an event (significance) which nor- 
mally occurs infrequently. As a result, 
the usual ¢ tables underestimate the true 
probabilities, that is, the probabilities 
obtained suggest a level of significance 
which is higher than warranted. An- 
other way of saying it is that if, out of 
several subgroups, one finds one or two 
t’s significant, he is, in effect, capitalizing 
on chance (e.g., Sakoda, Cohen, & Beall, 
1954). A further complication arises if 
the interaction is significant, since this 
introduces the usual difficulties about 
interpreting the main effects (e.g., Lind- 
quist, 1953, p. 209). Some of the spe- 
cial statistical problems involved in the 
“postmortem” testing of comparisons 
were, of course, being discussed in the 
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psychological literature well before dis- 
sonance theory appeared on the scene 
(e.g.. McHugh & Ellis, 1955); but for 
an excellent discussion of the basic issues 
involved in making multiple compari- 
sons, see the article by Ryan (1959). 
None of these problems is ever faced 
squarely by the writers in this field. As a 
result, the authors sometimes reach con- 
clusions that are not really warranted. 
Examples. We can illustrate these 
remarks by referring to an analysis 
carried out by Brehm (1960) on two 
treatment variables, commitment and 
communication. There are three levels 
of commitment—control, low-eating, 
and high-eating—and, in addition, two 
types of communication—support and 
no support. Since the Ws for these six 
groups are different (they vary from 7 
to 11), it is not possible to carry out an 
ordinary analysis of variance. With such 
data at least 15 ¢ tests and 3 interaction 
ts are possible. Brehm gives the results 
of 7 such ¢ tests (4 are nominally sig- 
nificant) and 2 such interaction ?#’s 
(both nominally significant). How do 
we interpret the results? Frankly, it is 
impossible. Taken at its face value, the 
analysis is not only useless, it is mis- 
leading. a 
An allied set of criticisms can be 
leveled at the analysis carried out by 
Brehm and Cohen in their study of the 
effects of choice and chance in cognitive 
dissonance (1959a). The design in- 
volved two types of relative deprivation, 
high and low. Five sections of an intro- 
ductory psychology course were used as 
Ss. The low- and high-deprivation con- 
ditions were experimentally manipulated 
and perceived as such by the Ss. The 
low- and high-choice conditions were, 
however, determined separately for each 
section on the basis of their medians on 
the perceived-choice rating scale. Sepa- 
rate interaction #’s were calculated for 
each of the five sections. The Ns in each 


cell were very small, ranging between 3 
and 10 with an average of about 7. The 
probability values for these 5 interaction 
t’s showed that one was significant, two 
tended to significance, and two were 
nonsignificant (one was actually a re- 
versal). Here again the authors’ failure 
to compute and report the results of an 
overall test make it exceedingly difficult 
for readers to interpret their findings. 
Moreover, there seems to be little justifi- 
cation for using a different value for the 
cutoff point between high- and low- 
choice for each section. In fact, such a 
procedure might in itself lead to statisti- 
cally significant median differences be- 
tween the sections. There may indeed 
be a significant interaction between 
choice and deprivation, but the evidence 
for it is, at best, questionable. 

“In dissonance experiments there is 
often a marked change between the pre- 
and posttest measures for both control 
and experimental groups. This is in it- 
self an interesting phenomenon and 
should be thoroughly evaluated. An 
analysis should be complete—large main 
effects should not be ignored just be- 
cause dissonance theory predicts only an 
interaction, or vice versa. 

It is not impossible to apply a rigor- 
ous methodology to this area. Disso- 
nance theorists would have done well to 
emulate the example set by Kelman as 
far back as 1953 (a study which, inci- 
dentally, anticipates and predates most 
of the areas of interest for cognitive 
dissonance workers). All of the problems 
that beset research in this area, such as 
unequal Ns, class differences, and so on, 
were handled expertly by Kelman. More 
recently, such eclectic workers as Rosen- 
baum and Franc (1960) and McGuire 
(1960) have also been working in this 
area and have been using rigorous and 
comprehensive methods of analysis. In 
short, there appears to be no reason why 
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methodology in this area cannot be 
sharpened. 

A summary of experiments in which 
the analyses and statistical interpreta- 
tions are doubtful is given in Table 2. 


Straining for Significance 


The final feature of the analyses that 
is apt to be misleading is the fact that 
authors tend to present results as sig- 
nificant and as supporting the disso- 
nance theory prediction when the proba- 
bilities are greater than the usually 
accepted value of .05. Probability values 
between .06 and .15 (once even .50!) 
do not constitute striking support for 
any theory, particularly if it is preceded 
by a selection of Ss and poor analysis. 
It is also extremely disconcerting to find 
these statistically nonsignificant trends 
quoted authoritatively in subsequent re- 
ports and later reviews as substantiating 
the theory, without any qualifying state- 
ments, 


OVERALL EVALUATION 


Having now reviewed much of the 
experimental work supporting cognitive 
dissonance theory, we conclude that, as 
a body of literature, it is downright dis- 
appointing. Too many studies have 
failed to stand up to close scrutiny. Yet 
it is also obvious that the dissonance 
framework has a seductive allure for 
many social scientists, an allure not 
possessed by the rather similar, but 
symbolically more complex, interpreta- 
tions by Heider (1958), Osgood and 
Tannenbaum (1955), or Newcomb 
(1953). 


Paradox of Simplicity 


The magical appeal of Festinger’s 
theory arises from its extreme sim- 
plicity both in formulation and in appli- 
cation. But in our review we have seen 
that this simplicity was generally decep- 
tive; in point of fact it often concealed 


TABLE 2 


SUMMARY or SOME EXPERIMENTS WITH INADEQUATE DESIGN AND ANALYSIS 


Study 


Criticism of design and analysis 


Allyn & Festinger (1961) 


Aronson & Mills (1959) 
Brehm (1956) 


Brehm (1960) 
Brehm & Cohen (1959a) 
Cohen (1959) 


Cohen, Terry, & Jones (1959) 


Ehrlich et al. (1957) 
Festinger & Carlsmith (1959) 
Mills (1958) 


Mills, Aronson, & Robinson 
(1959) 


Rosen (1961) 


No control group (repeat attitude test, no talk); inter- 
action significance not presented 


Overall significance not presented 


Maximum W = 225, but regression equation based on 
N = 557 and N = 534 


Overall significance not presented 
Overall significance not presented 


No control group (repeat attitude test, no counterinfor- 
mation); groups not equated on initial attitude 


No control group (repeat attitude test, no new informa- 
tion); groups not equated on initial attitude 


No control group (predecision car ad reading) 
Overall significance not presented 
Overall significance not presented 


No control group (preferences, but no decision); overall 
significance not presented 


No control group (preferences, but no decision) 
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a large number of confounded variables, 
Clearly much can be done to untangle 
this confounding of variables by careful 
experimental design. Nonetheless, there 
may still remain another problem more 
fundamental than this. In general, a 
cognitive dissonance interpretation of a 
social situation means that the relevant 
social factors can be condensed into two 
simple statements. To be sure, Fes- 
tinger does not say formally that a dis- 
sonance theory interpretation works 
only for two discrepant statements; but 
it is precisely because in practice he 
does so limit it that the theory has had 
so much acceptance. Which brings us 
now to the crux of the matter: is it 
really possible to reduce the essentials 
of a complex social situation to just two 
phrases? Reluctantly we must say “No.” 
To condense most complex social situa- 
tions into two, and only two, simple 
dissonant statements represents so great 
a level of abstraction that the model no 
longer bears any reasonable resemblance 
to reality. Indeed the experimenter is 
left thereby with such emasculated pre- 
dictors that he must perforce resort to 
a multiplicity of ad hoc hypotheses to 
account for unexpected findings. We see 
then that the most attractive feature of 
cognitive dissonance theory, its sim- 
Plicity, is in actual fact a self-defeating 
limitation, 

In conclusion, all of the considerations 
detailed above lead us to concur with 
Asch’s (1958) evaluation of the evi- 
dence for cognitive dissonance theory, 
and return once more a verdict of NOT 
PROVEN. 
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ELECTRICAL SELF-STIMULATION AND ITS 
THEORETICAL IMPLICATIONS? 


C. R. GALLISTEL 2 
Stanford University 


Evidence from electrical self-stimulation experiments is reviewed in 
the light of Deutsch’s structural theory of behavior. The theory ac- 
counts for all salient observations; viz., high drive, little or no satia- 
tion, rapid extinction, the need for “priming” at the start of a day’s 
trials, the difficulty in creating secondary reinforcers by using electrical 
stimulation of the brain as a primary reinforcer, the fact that different 
electrode placements interact with different natural drives, and the 


separability of the motivation and reinforcement processes. 


theories are briefly reviewed. 


In 1954 Olds and Milner discovered 
that electrical stimulation of certain 
areas in the brain was rewarding; i.e., 
animals would learn to seek out such 
stimulation. If, for example, rats could 
stimulate certain parts of the septum, 
the hypothalamus, and the tegmentum 
by pressing a lever in a Skinner box, 
they would learn to press the lever at 
quite a high rate for long periods of 
time. 

Many of the results of the research 
prompted by this discovery are confus- 
ing and somewhat paradoxical, as Zeigler 
(1957) noted when he first reviewed the 
literature. It has been found that al- 
though animals show a strong drive for 
electrical stimulation during experi- 
mental sessions, responses learned for 
this stimulation will extinguish with 
extreme rapidity. Likewise, the per- 
formance of animals in a maze where 
they are rewarded with brain shock will 
improve steadily during single experi- 
mental sessions with massed trials, yet 
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the performance declines markedly be- 
tween the last trial in one day’s session, 
and the first trial of the next day’s ses- 
sion. Finally, in spite of the evidence 
for strong drive and for high reward 
from electrical stimulation of the brain, 
most experimenters have apparently 
found it difficult to maintain respond- 
ing on VI schedules (variable intervals 
of time between reinforcements) with 
a mean higher than 16-30 seconds, or 
on FR schedules (fixed number of re- 
sponses between reinforcements) higher 
than 10:1, 

Many theorists have cited the phe- 
nomenon of self-stimulation as support 
for hedonic theories of motivation and 
reward, but until such theories are 
elaborated it will remain hard to judge 
whether the data do in fact support 
them. That these areas are in some way 
involved in reward (or better reinforce- 
ment) is obvious from the very fact that 
they produce learning. To say that the 
sensations produced by electrical stimu- 
lation in these areas are pleasurable does 
not explain how these sensations operate 
to produce learned behavior. Nor does 
it by itself explain the paradoxical 
results noted above. 

In order for the results of electrical 
stimulation studies to shed much light 
on the mechanisms of drive and reward, 
it is necessary to have some clearly 
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worked out theories about the nature of 
these mechanisms. Such theories must 
provide answers to two questions: (a) 
How does the electric shock act on the 
central nervous system to organize the 
animal’s behavior in such a way that it 
can reproduce the actions which led to 
the shock? (b) Whence comes the 
motivation which activates the response 
patterns established by the shock? 

This paper will evaluate the results of 
electrical self-stimulation in the light of 
a structural theory of behavior proposed 
by Deutsch (1960). The thecry is quite 
rigorous, and the data appear to accord 
well with its predictions. Only that part 
of the theory is given which is directly 
relevant to an explanation of the char- 
acteristics peculiar to the phenomenon 
of self-stimulation. 

The theory postulates that there are 
centers in the brain sensitive to bodily 
deficits or other motivating conditions, 
which centers normally generate motiva- 
tional excitation. This excitation flows 
from these centers along pathways to the 
sensory-motor units of the brain, which 
determine the behavior pattern that this 
excitation will energize. 

The motivational centers have sensory 
inputs which signal the arrival of goal 
objects such as food, water, sexual grati- 
fication, etc. These inputs have two ef- 
fects: First, they gradually inhibit the 
sensitivity of the motivational centers, 
rendering them less sensitive to the con- 
ditions which excite them. This ac- 
counts for the cessation of consum- 
matory activities. Second, when the 
center receives the goal signals, it sends 
out reinforcing impulses that hook up 
the neural units which mediated the 

actions that immediately preceded the 
goal signal. Thenceforth, motivational 
excitation from the primary center flows 
to these units via the connections estab- 
lished between them by the reinforcing 


impulses. Near the primary center the 


reinforcing impulses travel on pathways 
which run alongside the motivational 
pathways from that particular center. 

The theory assumes that electrode 
placements which produce self-stimu- 
lation are located in these motivation 
and reinforcement channels. A shock 
from such an electrode has two effects. 
It introduces artificial excitation into 
both the reinforcement and the motiva- 
tion pathways. The impulses in the rein- 
forcement pathways hook up the neural 
units which mediated the behavior lead- 
ing to the shock. The impulses in the 
motivation pathways excite these units 
so that the animal repeats the perform- 
ance. 


STRENGTH AND PERSISTENCE OF DRIVE 


It should be emphasized at the outset 
of the review that although in some areas 
the results consistently point in one 
direction, in many other areas the results 
obtained depend for the most part on the 
placement of the electrode and the in- 
tensity of the stimulating current. One 
of the advantages of the structural 
theory is that it seems to account for 
most, if not all, of this variability in the 
results. 

Following the structural theory, one 
would expect to see little or no satiation 
during continual electrical self-stimula- 
tion, since the reinforcing impulses do 
not themselves produce satiation. It is 
the incoming sensory impulses which 
inhibit the primary centers, thereby 
causing satiation. In general, the com- 
plete absence of satiation has been one 
of the most striking and consistent 
results of self-stimulation experiments. 
Olds (1958c) has found that rats will 
stimulate themselves continually for 24— 
48 hours until they drop from exhaus- 
tion. 

There is only one report of satiation 
effects in the literature. Olds (1958d) 
reported that there is a significant de- 
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cline in the rate of lever pressing in rats 
with telencephalic placements after they 
have continually stimulated themselves 
at a fairly high rate for 8 hours. How- 
ever, considering the rate at which they 
self-stimulate, 8 hours is an exceedingly 
long period. This may mean that the 
decline in rate is due to factors other 
than satiation. 

The fact that animals will self-stim- 
ulate till they drop from exhaustion in- 
dicates that the drive for this stimula- 
tion must be fairly strong. This has been 
borne out in experiments by Olds 
(1958c) where he demonstrated that 
rats would cross a much more highly 
charged grid for electrical stimulation of 
the brain than they would for food when 
they were 24 hours hungry. 


EXTINCTION 


In view of the evident strength of the 

drive, it is puzzling that extinction 
should come so quickly. Yet, a close 
look at the data from most experiments 
shows that a very sharp extinction curve 
is the rule. Seward, Uyeda, and Olds 
(1959) reported: 
Our results confirm earlier reports that this 
form of reinforcement produces steep extinc- 
tion curves. It would be of interest to find 
comparable data for a drive-maintained re- 
sponse. Unfortunately we have discovered no 
such data [p. 298; italics added]. 


These results are not so puzzling in 
the light of the structural theory. Since 
the motivation for the responses is arti- 
ficially injected by the shock itself, one 
might expect the responses to disappear 
soon after cessation of the shock, be- 
cause the motivation for them disap- 
peared and not because they were ex- 
tinguished, i.e., unlearned. This also 
accords with the common observation 
that after “extinction” the animals can 
be recalled to the lever by giving them 
a few shocks (Olds & Milner, 1954). 

In one case Olds (1956) has obtained 
something like a normal extinction curve. 


However, these animals were running 
under 24-hour food deprivation. The 
motivation in this case may have come 
partly from the hunger center. The in- 
teraction between artificial motivation 
induced by brain shock and natural 
motivation will be discussed later. 

Relevant here are data obtained by 
Seward, Uyeda, and Olds (1960) con- 
cerning massed and spaced trials. They 
found that performance during sessions 
of massed trials in a runway (with a 
lever in the goal box) improved mark- 
edly from trial to trial. There was, how- 
ever, no trial-to-trial improvement in 
performance when the trials were spaced 
15 minutes apart. The authors admitted 
that this is unusual: “Most studies on 
runway performance to food or water 
have reported no appreciable effect on 
acquisition due to spacing within the 
limits here used [p. 227]. They also 
found that the subjects extinguished 
quickly whether the lever was left in the 
goal box or withdrawn from it at the 
beginning of extinction, Finally, the 
tats given massed trials extinguished 
more slowly than the rats given spaced 
trials. Similarly, Olds ( 1956) found 
large decrements in performance in 
mazes from the last trial on one day to 
the first trial on the next. Nevertheless, 
taking each session as a whole, the 
animals showed day-to-day improve- 
ment. Here again, we might expect such 
results if we assume that decay of 
motivation over time and not reward is 
the key variable in spaced trials and be- 
tween sessions. 

In another experiment Olds (1958) 
found that his rats showed marked im- 
provement in the first trial running 
speed over the first 4 days. However, 
his rats were 18-24 hours hungry. In 
this case hunger may have supplied the 
motivation. Nonetheless, he noted that 
the first trial of each day was slower 
than the last trial of the preceding day; 
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whereas for the controls who were run- 
ning for food, the first trial was con- 
sistently faster than the last trial of the 
preceding day. The structural theory 
would predict that if the rats were run 
satiated, the day-to-day improvement in 
first-trial running speed would be much 
less marked and the decrement between 
the last trial of one day and the first 
trial on the next would be still larger. 

In support of this hypothesis one may 
note that experimenters have apparently 
found it very hard to maintain respond- 
ing on anything but small fixed-interval 
or fixed-ratio schedules. Sidman, Brady, 
Conrad, and Schulman (1955) reported 
schedules of VI=16 and FR=7:1. 
Brady and Conrad (1960), using a very 
high current, were able to keep their 
rats responding on a VI schedule with a 
60-second mean. This compares poorly 
with VI schedules having means of 5- 
10 minutes or more, which will maintain 
slow but steady responding when the 
reward is food or water. 

Monkeys seem able to maintain some- 
what higher schedules. Brady (1960) 
maintained responding in monkeys at 
FR=50:1. Brodie, Moreno, Malis, and 
Boren (1960), again using a very high 
current (3 milliamperes), found one 
monkey which would respond on FR= 
150:1. This is the largest FR in the 
literature. Four of his eight monkeys, 
however, would not respond on a fixed 
ratio higher than 20:1. No one has 
shown that a partial reinforcement 
schedule increases the number of trials 
to extinction. The structural theory 
would predict that partial reinforcement 
would decrease the number of trials to 
extinction instead of increasing them, as 
it does when natural reinforcers are 

used. 

An experiment by Howarth and 
Deutsch (1962) gives very strong sup- 
port to the hypothesis that extinction of 
responses induced by electrical stimula- 


tion is a function of drive decay and not 
of unlearning. They withdrew the lever 
from a Skinner box for periods of 2.5, 
5.0, 7.5, and 10.0 seconds at the start of 
extinction. They then replaced the lever 
and allowed the rats to press it to some 
criterion of extinction. They found that 
the rats gave the same number of 
presses to extinction that they would 
have given, had they been permitted 
to press the lever (for nonreinforcement) 
during the period when it was removed 
from the box. For example, the rats 
pressed the lever an average of 1.92 
times before reaching extinction after 7 
seconds of nonreinforced lever pressing, 
and they pressed it an average of 1.87 
times before reaching extinction after 
the lever had been removed from the 
box for 7 seconds at the start of extinc- 
tion. The number of responses to ex- 
tinction was purely a function of the 
time elapsed since the final brain shock 
and not of the number of unreinforced 
presses. i 


SECONDARY REINFORCEMENT 


From the structural theory one should 
normally not be able to demonstrate the 
creation of secondary reinforcing cues 
by the use of intracranial stimulation as 
a primary reward. In the structural 
theory, shock produces the motivation. 
Hence, no shock, no response; unless 
some natural motivation is also present. 
This is in contrast to what one might 
expect from some sort of pleasure or 
incentive theory. The incentive theories 
might account for quick extinction, etc., 
by pointing out that the incentive was 
absent. One ought to be able to assume; 
though, that cue stimuli which had been 
associated with the incentive would serve 
to produce the behavior. 

The evidence here is inconclusive. In 
general, though, it has been very difficult 
to show secondary reinforcement phe- 
nomena. Seward, Uyeda, and Olds 
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(1959) paired a light with all lever 
presses that produced shock, but found 
that the light did not increase the num- 
ber of trials to extinction once the shock 
had stopped. This was probably not due 
to failure to discriminate or take note of 
the light, since it has been shown that 
rats are capable of discriminating tones 
concurrently with rewarding intracranial 
stimulation (Beer & Valenstein, 1960). 

Stein (1958) did obtain some evidence 
of secondary reinforcement. He placed 
the rats in a Skinner box with two 
levers, one of which produced a tone and 
one of which did not. He recorded their 
operant rate of pressing each lever for 1 
hour on 6 successive days. Then both 
levers were removed and the rats were 
given 100 brain shocks paired with the 
tone. The levers were again presented 
and the rate of pressing on each was 
again recorded. The brain-shock-posi- 
tive group showed a significant prefer- 
ence for the tone linked lever and a 
significant increase in their absolute rate 
of pressing the tone lever. A brain- 
shock-neutral group showed little or no 
change in either preference or rate. 

It should be noted, however, that the 
increased rate of pressing the tone lever 
in the experimental group is insignifi- 
cant when compared with their rate of 
Pressing for electrical stimulation itself. 
In a posttest to distinguish between the 
control and the experimental rats (the 
brain-shock-positive ones), the latter 
pressed a lever for electrical stimulation 
at a minimum of 9 times a minute or 540 
times an hour. This compares poorly 
with the rate of 35 responses an hour 
obtained for the tone alone. 

Since histological verifications and the 
details of pre-experimental conditions 
such as feeding are not given, one can- 
not rule out the presence of some natural 
Motivation which the structural theory 
must assume. The theory would predict 
that the outcome of a secondary rein- 


forcement experiment such as this one 
would vary with the manipulation of 
natural drives. 


RELATIONSHIP BETWEEN EXPERIMEN- 
TALLY INDUCED AND NoRMAL 
MOTIVATION 


Mapping studies (Olds, 1960b; Olds, 
Travis, & Schwing, 1960) have shown 
electrical self-stimulation phenomena 
from a number of placements in the 
septum, the telencephalon, the hypo- 
thalamus, and the tegmentum. Olds 
thought he found two systems where 
electrical self-stimulation phenomena are 
present. The main system—the one 
which produces the highest rates of self- 
stimulation—is in the medial forebrain 
bundle and includes the medial anterior 
and posterior hypothalamus and the 
ventromedial regions of the telenceph- 
alon and the tegmentum. The second 
system, which produces lower rates. 
winds from the caudate and septal areas 
through the dorsal thalamus and into 
the tectum. 

In the main or primary area, rate was 
a positive linear function of the stimulus 
intensity. In the secondary area rate 
was a square or fluctuating function of 
the stimulus intensity; i.e., as the stim- 
ulus intensity was increased the rate 
either went up and down or it quickly 
leveled off at a maximum. Olds the- 
orized that placements producing posi- 
tive linear correlation between rate and 
intensity were surrounded by reward 
cells for some distance so that the cur- 
rent spread at higher stimulus inten- 
sities simply affected more and more 
reward cells; that placements producing 
square functions were surrounded by a 
thin layer of reward cells with only 
neutral cells beyond these; that place- 
ments producing fluctuating functions 
were surrounded by concentric layers 
consisting alternately of reward and 
avoidance cells. Reynolds (1958) in a 
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similar study’ found only one pattern, 
namely, a steady increase in rate up to 
an “optimum” voltage followed by a 
steady decrease. 

Hodos and Valenstein (1960) have 
pointed out, however, that the second 
system may produce lower rates simply 
because shocks in these areas also cause 
interfering motor effects such as twitches 
and crouching. They emphasize that the 

rate of pressing is not a good measure of 
the size or intensity of the reward. They 
gave their rats a two-lever choice situa- 
tion in which they could obtain mod- 
erately strong stimulation from a septal 
electrode or low stimulation from a 
hypothalamic electrode. Their rats pre- 
ferred the lever producing the stronger 
septal stimulation although the rate of 
pressing for the septal stimulation was 
slower than that for the hypothalamic 
stimulation. 

Since the motivational channels in the 
brain must spread out from the motiva- 
tional centers thought to be located in 
and around the hypothalamus, the 
results of the mapping experiments ac- 
cord with what one might expect on the 
basis of the structural theory. One would 
also expect from the structural theory 
that the results obtained from different 
placements would be differentially af- 
fected by natural motivational states, 
The structural theory assumes separate 
reward and motivation channels for 
hunger, thirst, sex, fear, etc. Therefore, 
the performances for electrical stimula- 
tion obtained from one placement should 
be influenced by one of these naturally 
motivating states but not by the others, 
unless the electrodes were so large as 
to affect channels from two or more 
centers. 

A number of studies give evidence 
that this is in fact the case. Brady, 

Boren, Conrad, and Sidman (1957) 
found that with certain placements, the 
rate of responding for electrical stimu- 
lation was considerably higher when the 
animals were 18-24 hours hungry and 


thirsty. Unfortunately they did not 
separate the conditions of hunger and 
thirst. Hoebel and Teitelbaum (1962) 
implanted self-stimulation electrodes in 
the lateral hypothalamus. They found 
that ablating or anesthetizing the medial 
hypothalamus increased both eating and 
rates of electrical self-stimulation. Elec- 
trically stimulating the medial hypo- 
thalamus decreased both food consump- 
tion and rates of self-stimulation. When 
they applied the techniques of anestheti- 
zation, ablation, and electrical stimula- 
tion to certain parts of the lateral hypo- 
thalamus, they obtained the exact 
converse of the above results. 

Olds (1958b), in an experiment on 16 
castrated rats, found that both hunger 
and androgen levels influenced rates of 
self-stimulation. Further, he found that 
if hunger increased the rate of self-stim- 
ulation from one placement, then high 
androgen levels decreased it, and vice 
versa. This is what we might expect, 
since a high nonrelevant drive would in- 
terfere with a placement which was in- 
serting artificial motivation into another 
drive system, 

Deutsch and Howarth (1962) have 
demonstrated an interesting relationship 
between fear and self-stimulation from 
tegmental placements. They trained 10 
rats to press a lever for electrical stim- 
ulation and then “extinguished” the re- 
sponse by holding the rats away from 
the lever for a minute or two. After a 
5-minute pause during which the rats 
generally did nothing, they frightened 
them with a loud buzzer or a mild shock 
to the feet. The five tegmental animals 
immediately returned to the lever and 
began pressing it. They sometimes kept 
on pressing for several minutes. The 
other five rats did not return to the lever- 
This indicates that the tegmentum con- 
tains pathways for the motivational ex- 
citation arising from fear, and that a 
habit learned for electrical stimulation 
of the brain (ESB) can be reactivated 
by naturally arising motivation. 
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It is interesting to note in this con- 
nection that Olds (1958c) reported an 
avoidance area just beneath the teg- 
mentum (see also Delgado, Roberts, & 
Miller, 1954). This may be a fear cen- 
ter. The hypothesis that the tegmentum 
contains fear pathways originating in or 
near this area may shed light on a puz- 
zling result that Olds and Milner (1954) 
reported in their first paper. They found 
that one of their tegmental placements 
had an exceedingly high rate of operant 
responding even when it was not being 
reinforced by ESB: 

Finally there is an electrode . . . between 
the red nucleus and the posterior commissure. 
It produced an acquisition score of 77%, but 
an extinction score of 81%. This must be a 


rewarding placement, but the high extinction 
score makes it difficult to interpret [p. 23]. 


It may well have been that the animal 
was quite frightened, in which case this 
dovetails nicely with the results ob- 
tained by Deutsch and Howarth. 
Finally, Brady and Conrad (1960) 
ran an experiment testing for anxiety 
effects during self-stimulation. They 
sounded a tone for 5 seconds at the end 
of which they delivered a painful shock 
to the animal’s feet. The tone thus be- 
comes a conditioned stimulus (CS). 
Normally, sounding the CS will abruptly 
halt responding in a Skinner box until 
after the unconditioned stimulus (UCS). 
Their results, however, were very incon- 
sistent. They ran the experiment on six 
tats, two cats, and four monkeys. They 
found no anxiety effect in the rats, but 
both cats demonstrated it. Some mon- 
keys showed the anxiety affect and some 
did not. In one monkey they found no 
attenuation of responding from one 
placement (medial forebrain bundle), 
but marked attenuation from another 
(anterior thalamus). It is tempting to 
Speculate that the placements where no 
attenuation was found were in fear path- 
ways. Be that as it may, the authors 


(Brady & Conrad, 1960) concluded that 
their findings 


would seem to provide strong support for the 
conclusion that the reinforcing properties of 
electrical stimulation of different parts of the 
brain may derive from quite different behav- 
ioral and physiological effects [p. 35]. 

The data, then, clearly support the 
hypothesis that there are channels of 
reward and motivation specific to the 
different natural drive states. But no 
one has yet mapped out what areas are 
related to which drives. 


REWARD AS A SEPARATE PROCESS 


There is good evidence to show that 
electrical stimulation of certain areas is 
motivating as well as rewarding. But 
what proof is there that reward and mo- 
tivation are separate processes? 

There is little direct experimental evi- 
dence on this score. Nevertheless, con- 
sideration of the indirect evidence makes 
this the most plausible conclusion. 
Thirst, hunger, sex, and fear motivation 
have been artificially introduced by elec- 
trical and chemical stimulation without 
any observed concomitant rewarding 
effects (Delgado & Anand, 1953; 
Larsson, 1954; Miller, 1957; Smith, 
1956). In some cases this is probably 
due to failure to look for or test the 
rewarding properties. Margules and 
Olds (1962), using rats, found that all 
placements which produced increased 
eating also produced high rates of self- 
stimulation. It is clear, however, that 
in the case of sex and fear, motivation 
can be artificially produced by electrical 
stimulation without any concomitant re- 
warding effects (Miller, 1961). Hunger 
and thirst can also be produced by 
single injections of NaCl (Hoebel & 
Teitelbaum, 1962; Miller, 1961). The 
motivating effect of these injections is 
marked and fairly long-lasting. Their 
rewarding effect, if any, has never been 
observed. Since some electrode place- 
ments clearly produce both motivation 
and reward, it seems plausible to argue 
that they are two different processes. 
In the face of the evidence it would seem 
highly unlikely that the rewarding 
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effects of electrical stimulation could be 
accounted for by a drive reduction hy- 
pothesis. On the contrary, electrical 
stimulation from self-stimulation place- 
ments appears to increase, not to reduce, 
the drive for more. 

The only direct experimental evidence 
comes from an as yet unpublished ex- 
periment by Deutsch, Howarth, Ball, 
and Deutsch (1962). They reasoned 
that if electrical self-stimulation place- 
ments are activating two separate proc- 
esses or systems—a motivation system 
and a reward system—then the thresh- 
olds of electrical activation for the two 
systems might be different. A prelimi- 
nary experiment indicated that the re- 
ward threshold was higher than the 
motivation threshold. It indicated that 
a current intensity just below the gen- 
eral threshold for electrical self-stimu- 
lation phenomena might be above the 
motivation threshold but below the re- 
inforcement threshold. Consequently, 
animals stimulated by this current would 
show motivation by persisting in an al- 
ready learned response but would not 
show any new learning. 

To test this they compared perform- 
ance under four different conditions of 
stimulation after an initial period of 
training on a lever pressing habit for 
electrical stimulation. Two conditions 
involved subthreshold stimulation; i.e., 
stimulation at intensities lower than those 
which were necessary to induce learning, 
In the first of these the stimulation was 
contingent upon a lever press—the rat 
received stimulation each time it pressed 
the lever and not otherwise. In the sec- 
ond condition stimulation was presented 
regularly by a pulse generator at a rate 
approximately equal to the rat’s normal 
rate of responding for suprathreshold 
stimulation. Thus the animal received 
regular stimulation whether it pressed 
the lever or not. 

The prediction here was that when the 

rats were receiving regular but not nec- 
essarily contingent stimulation, they 


would respond more frequently than 
when stimulation was contingent upon 
lever pressing. Since electrically in- 
duced motivational excitation decays 
rapidly, any slight pause in responding 
would permit it to drop nearly to zero 
when stimulation is contingent upon 
lever pressing. Once motivation had 
dropped to zero, there would be little 
chance of restoring it because further 
stimulation is contingent upon pressing 
the lever and the rat would no longer be 
motivated to do so. However, if motiva- 
tion is regularly replenished regardless 
of whether the animal is responding or 
not, small pauses in responding will not 
result in complete cessation of motiva- 
tion. Seven out of eight rats did in fact 
respond more frequently while receiving 
regular noncontingent subthreshold 
stimulation than while receiving con- 
tingent subthreshold stimulation. 

The second two conditions were the 
same as the first two except that stimu- 
lation was at threshold intensity. In 
this case the converse of the above re- 
sults was expected. When threshold 
stimulation is given regularly by a pulse 
generator, it will be to a greater or lesser 
extent noncontingent upon the lever 
press; hence it will tend to reinforce 
either competing or at least interfering 
responses or “superstitions.” On the 
other hand, contingent stimulation re- 
inforces only the correct response, and, 
at threshold, it is strong enough to 
motivate a rapid rate of responding with 
little likelihood of complete decay of 
motivation during short pauses. Con- 
sequently, at threshold intensities re- 
sponding should be more rapid with 
contingent stimulation than with non- 
contingent stimulation. This proved to 
be the case for all rats tested. 


REWARD-AVOIDANCE PLACEMENTS 


Several experiments have shown that 
animals find long shocks from some 
placements unpleasant, even though 
short shocks from these placements are 
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rewarding. It is difficult to say just 
what theoretical significance, if any, 
these results may have. 

Roberts (1958) first reported the 
phenomenon. He found that he could 
train cats to learn the correct turn in 
a T maze in order to escape a continuous 
brain shock. When, however, he put the 
cats in an escape box and opened the 
door 15 seconds before giving them a 
continuous shock, they would not learn 
to leave the box in order to avoid the 
shock, although they did learn to escape 
from the box in order to terminate the 
shock. Roberts hypothesized that the 
onset of the shock was rewarding. To 
test this, he put the animals in a self- 
stimulation Skinner box. They did in- 
deed learn to press the lever. 

Miller (1957a) reported an experi- 
ment by Kirschner in which he taught 
rats to press one lever to produce a brain 
shock and another lever to terminate the 
shock. 

The most complete study of the phe- 
nomenon was done by Bower and Miller 
(1958). They clearly confirmed initial 
hypotheses that continuous shocks from 
some placements gradually become un- 
pleasant even though they are at first 
rewarding. As a start, they replicated 
Roberts’ experiments and further dem- 
onstrated that rats can learn to leave a 
box in order to avoid an unpleasant 
brain shock provided that the placement 
is not rewarding for short shocks. They 
provided additional evidence by divid- 
ing their rats into two groups: those 
which showed reward-avoidance effects 
and those which showed only reward ef- 
fects. They found that the reward-avoid- 
ance placements had a shorter mean 
duration of presses than did the pure 
reward placements when the brain shock 
continued as long as the lever was kept 
down. They also tested the rats under 
conditions where each bar press pro- 
duced a 3-second shock. They found 
that the rate of pressing for the reward- 
avoidance placements decreased sharply, 


whereas the pure reward placements 
showed no decrease in their rate of 
pressing. Finally, they did careful histo- 
logical work which showed that the re- 
ward-avoidance placements were in the 
middle to anterior portions of the medial 
forebrain bundle and that the fornix 
was not involved. Placements in the far 
posterior parts of the medial forebrain 
bundle and other parts of the hypothala- 
mus were purely rewarding. Olds and 
Peretz (1960) found that most of their 
reward-avoidance placements were lo- 
cated just below the medial lemniscus. 
Olds (1960a) reports reward-avoidance 
placements in the medial hypothalamus 
and the lower tegmentum. 

Why placements in the forward part 
of the medial forebrain bundle produce 
unpleasant effects during long shocks is 
not known. The shock may spread to 
avoidance centers or it may spread to 
afferent pain fibers. Bower and Miller 
(1958) reported that some of their re- 
ward-avoidance placements showed 
clear signs of pain when they received 
long shocks. It may be also that the 
shock slowly affects the pain receptors 
in the blood vessels of the brain (if such 
receptors do exist). 


Druc STUDIES AND MISCELLANEA 


Several experimenters have studied 
the effects of drugs on self-stimulation 
behavior (Miller, 1957b; Olds, 1958, 
1961; Olds, Killam, & Eiduson, 1957). 
Since the effects of the drugs are them- 
selves under investigation and still 
poorly understood, the results of these 
studies are more relevant to the litera- 
ture on psychotropic drugs than to the 
literature which is primarily concerned 
with the properties of self-stimulation. 

Ward (1959) did a study to deter- 
mine which properties of the electrical 
stimulus were crucial in determining the 
threshold for self-stimulation and the 
rate of responding from septal place- 
ments. Using a sine wave stimulus of 
constant current (.09 milliampere) and 
duration (.9 second per press), he found 
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that the alternation frequency of the 
current had little effect on rate of press- 
ing in the range from 10 to 1,000 cycles 
per second; above and below these fre- 
quencies the rate fell off sharply. In the 
threshold experiments he used a stimu- 
lus of .2 second duration in the form of 
biphasic rectangular pulses; the length, 
frequency, and intensity of which he 
varied systematically. If we regard the 
threshold as a constant C, his results 
can be summarized by the equation 
I*t=C, in which I equals the intensity of 
the current and t equals the length of 
the pulse.” The number of pulses did 
not materially affect the threshold in 
the range from 20 to 100 pulses per 
second. At 10 pulses per second he could 
not get his rats to maintain responding. 
Albino and Lucas (1962) claimed to 
have demonstrated that stimulation at 
one positive electrode outside the sep- 
tum immediately prior to septal stimula- 
tion facilitates responding for the septal 
stimulation. The presentation of their 
data, however, makes it impossible to 
decide whether the data justify such a 
conclusion or not. 


OTHER THEORIES 


There is a paucity of rigorous theories 
attempting to deal with the complete 
range and peculiar characteristics of the 
results from self-stimulation experi- 
ments. 

Several theorists, especially those who 
incline toward drive reduction or homeo- 
static theories of learning, have sug- 
gested ways in which the rewarding 
effect of the stimulation could be ex- 
plained. Prima facie, this rewarding 
effect is difficult to account for in terms 

of drive reduction. Brady (1958); 
Miller (1961a); and Nielson, Doty, and 
Rutledge (1958) have all suggested that 
the electrical stimulus may reduce drive 
by temporarily obliterating the drive 
3I am indebted to C. I. Howarth for hav- 


ing pointed out this simple mathematical sum- 
mary of the findings. 


stimuli in the central nervous system— 
or, in Miller’s words: 

[by] temporarily blocking out motivational 
sources of stimulation. Such a functional 
elimination of motivational stimuli might oc- 
cur during unconsciousness produced by a 
petit mal seizure [p. 576]. 

Nielson et al. feel strengthened in this 
view by the fact that the current inten- 
sities required to produce self-stimula- 
tion are considerably above those re- 
quired for perception as conditioned 
stimuli. 

This suggestion does not explain why 
many placements which produce evident 
convulsions do not produce self-stimula- 
tion, nor does it explain why a rat that a 
few minutes ago was sleeping in its cage 
will evince such a determination to re- 
turn to a self-stimulation lever which it 
has been pulled away from after being 
shaped to press it. In other words, this 
does not explain the drive arousing 
effects of the electrical stimulus. No one 
has attempted to account for the other 
peculiarities of self-stimulation behavior 
reviewed above on the basis of this sug- 
gestion. 


Miller (1961a) has also suggested 
that: 


In some cases, the reward phenomenon (pro- 
duced by subcortical stimulation) may be 
analogous to addiction, with the effects of 
one stimulation tending to remove the un- 


pleasant aftereffects of the preceding one 
[p. 577]. 


As Miller points out, this explanation 
would only apply in some cases, since it 
has been demonstrated that stimulation 
from some placements apparently has 
no aversive component (Brodie et al., 
1960). Nor, on the face of it, can this 
suggestion account for the interaction 
between self-stimulation and the vari- 
ous natural drives, 

Mowrer (1960) suggested that self- 


stimulation placements are in areas 
wherein: 


the cessation of drive (which homeostatic 
theory identifies with reinforcement) becomes 
translated from mere stimulation termination 
over into a more positive, more “active” state 


ELECTRICAL SELF-STIMULATION 33 


of some kind, and that it is the latter which, 
in the final analysis, is the agent that fixates 
response tendencies [p. 207]. 


This is not implausible as far as it goes, 
but it does not seem to account for much 
of the data beyond the bare fact that 
stimulation from such placements is re- 
warding. 

In an apparent extension of his earlier 
theory (Olds, 1955), Olds (1958a) has 
proposed that neural traces of a behavior 
sequence are laid down in the nervous 
system by their mere occurrence. Then, 
if certain reward areas are electrically 
stimulated at the end of the sequence, 
they will give the trace in question “af- 
ferent control of the neural elements 
which provide for the release of organ- 
ized traces into overt behavior.” The 
behavior will reappear when “some ade- 
quate representation of the environ- 
mental stimuli which preceded the be- 
havior ... [reorganizes] the whole 
trace [pp. 259-260].” 

In its present form the theory does 
not explain the many motivational prop- 
erties of electrical stimulation in reward- 
ing areas. For example, it does not seem 
able to explain why animals so fre- 
quently require “priming” at the start 
of a day’s trials, nor does it explain the 
rapid “extinction” of the responses 
learned for electrical stimulation. 

Of the theories so far proposed, the 
structural theory appears most adequate 
to account for the complete range of 
results from studies on the phenomenon 
of electrical self-stimulation. The suc- 
cess of the theory at this more physio- 
logical level is especially noteworthy, 
since it was originally based chiefly on 
behavioral data from maze and Skinner 
box studies. 
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The various meanings of the term adaptation are examined, from the 
evolutionary context to the physiological. Of the 2 main uses in behav- 
ioral studies the 1st refers to the learned adjustments made in the organ- 
ism’s lifetime, and the 2nd to processes of response decrement, as 
in sensory adaptation. The former are judged adaptive by certain 
evaluative criteria, whereas the latter can be experimentally analyzed. 
These uses are considered broadly in relation to the categories of 
ontogenetic and phylogenetic adaptations, and specifically in relation 


to current views on learning. 


The evolution of life has not been 
random, but oriented; and the major 
nonrandom orienting factor in this 
process is reasonably indentified as 
adaptation. Simpson (1958) therefore 
asserts that accounting for adaptation 
is the one central, inescapable problem 
of biology. But within the different 
biological sciences the evolutionary use 
of adaptation is modified and diluted: 
the term takes on various meanings and 
may cover many diverse kinds of 
processes. 

Used loosely, adaptation refers to an 
enormous array of phenomena, ranging 
across a time span of millions of evolu- 
tionary years to a few minutes in the 
case of “adaptation” of sensory proc- 
esses. In psychology alone, many refer- 
ences are made to adaptive behavior, 
processes of adaptation, habituation, 
and accommodation. Is there anything 
in common between these terms, or be- 
tween these and the evolutionary study 
of adaptation? 

It is time to inquire into this concept 
from a behavioral orientation. The pur- 
pose of this paper is, therefore, to 
survey the biological meaning of adapta- 
tion, first from an evolutionary and then 
from a behavioral point of view; next, 
to consider some experimental studies of 
So-called adaptation processes in psy- 
chology and physiology. Finally, the 
use of terms will be examined, and some 


implications for psychological thought 
discussed. 


Adaptation and Evolution 


It is an essential feature of evolution- 
ary theory that changes occur only when 
and only as long as they are adaptive to 
the environment, and the unraveling of 
this idea represents one of the most 
fascinating and baffling problems of 
modern analytical biology. In this 
analysis, four major factors have been 
recognized: the nature of the external 
environment; development of morpho- 
logical characteristics and physiological 
organization; behavior of the organism 
in such activities as food seeking, 
defense, and reproduction (especially in 
relation to natural selection); and 
genetics—the vast pool of genetic alter- 
natives available in a population. These 
factors do not act singly, but interact in 
many complex ways. 

The development of genetics over the 
past decades has made a significant con- 
tribution to current thought. We are 
now sufficiently free from Aristotelian 
teleology and well versed in genetics not 
to think of biological end states in terms 
of intelligent designers, but rather to 
acknowledge the opportunistic nature 
of adaptation; to see it as a historical 
process contingent upon the interplay 
of physical, ecological, and constitu- 
factors. In this process, dif- 
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ferent mechanisms such as the gor- 
geous plumage of birds, crests, color- 
ings, and odors can all come to serve 
the same biological end of sexual attrac- 
tion. Pittendrigh (1958) expressed this 
particularly lucidly, describing adapta- 
tion as a 


patchwork of makeshifts pieced together, as it 
were, from what was available when oppor- 
tunity knocked, and accepted in the hind- 
sight, not the foresight of natural selection 
{p. 400]. 


Modern theories have shifted the 
emphasis away from Darwin’s “struggle 
for survival.” Although there is more or 
less unanimous agreement nowadays that 
his principle of natural selection is fun- 
damental in evolutionary change, the 
emphasis is not so much on individual 
survival as on the successful reproduc- 
tion and adaptation of populations 
(Simpson, 1949, 1958). 

Increasing reproductive efficiency is 
one of the ways in which evolutionary 
Progress, or improved quality of adapta- 
tions, can be demonstrated. The whole 
notion of evolutionary progress is, how- 
ever, highly speculative, and it is un- 
certain what criteria should be applied. 
The various current ideas on specializa- 
tion of adaptations, progressive inde- 
pendence from the environment, the 
ability to cope with a greater variety of 
environmental factors, etc., have been 
treated in many excellent texts, most 
notably by Herrick (1956) and Simpson 
(1953). 


Adaptation and Behavior 


Evolutionary theories are largely 
based upon historical reconstructions of 
earlier forms of life. In the ontogenetic 
context, the term adaptation refers to 
current adjustments. Nevertheless, cer- 
tain principles common to both phylog- 
eny and ontogeny can still be discerned. 
Both refer to end directed systems (how- 
ever ill defined this phrase might be), 


both accept that there are alternative 
means of realizing or attaining their 
goals, and both set up certain criteria 
of adaptation such as successful adjust- 
ment to changing environmental condi- 
tions. A few thinkers, like Sommerhoff 
(1950), take the extreme view that all 
adaptations incorporate the same kinds 
of principles, and therefore can be 
treated within the same general formula. 

There seem, however, to be certain 
clear-cut points of difference. The en- 
vironmental background to which the 
individual adjusts in ontogenetic adap- 
tations may be physical or social. In any 
event the adaptation is specific to the 
individual and is not genetically trans- 
mitted. Its medium is the central nerv- 
ous system, whereas in phylogeny it is 
genetically determined. 

The truth of the matter would seem 
to be more complicated than either 
extreme. On the one hand it is difficult 
to see how Sommerhoff’s formula can 
encompass all the enormous variety of 
adaptation processes and ever be quanti- 
tative. On the other hand, some con- 
tinuity, some interaction, clearly occurs 
between phylogenetic and ontogenetic 
adaptations. 

The term “adaptive behavior” in the 
behavioral context is far less frequent 
and far less meaningful than it is in 
evolutionary thought. One reason is that 
the preoccupation with functional rela- 
tions and general laws of learning neces- 
sarily demands a strictly controlled en- 
vironment which allows the animal little 
opportunity for demonstrating or devel- 
oping new molar patterns of behavior- 

Most learning theories admit (albeit 
cursorily) that behavior is based on i 
herited adaptation patterns which have 
developed on the pragmatic basis of 
survival, and that it is their associatio™ 
with other stimuli and their satisfactio” 
which are primary factors in shaping 
behavior. Tt is further recognized that 
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many of the drives which influence be- 
havior are “acquired” as a result of in- 
dividual experience; and even that in- 
creased competence in dealing with the 
environment can be rewarding (White, 
1959). 

While the capacity for learning is 
inherited, its actual exercise is entirely 
dependent upon the organism’s interac- 
tion with its environment and the op- 
portunities provided by it. Lack of op- 
portunity can result in behavior which is 
undeveloped or “inappropriate”; con- 
versely, a rich and intensely varying 
environment can tax the “adaptive” 
qualities of the organism and render its 
behavior “maladaptive” or “malad- 
justed.” In this way, evaluative state- 
ments are built up such as “adjustment 
is an efficient relationship between an 
individual and his. environment which 
enables him to live comfortably and 
without strain and conflict.” 

These views both involve the notion 
of criteria: (a) that biological adapta- 
tions such as eating and mating behavior 
can become more efficient or appropriate 
as a result of experience, and (b) that 
some forms of learned behavior (e.g., 
social) can be better adapted to the 
existing conditions than others. 

Evaluative Criteria of Adaptive Be- 
havior. These, like the ones mentioned 
earlier in connection with the quality of 
phylogenetic adaptations, are also highly 
speculative. To a certain extent, “effi- 
ciency” of eating and mating can be de- 
scribed in reasonably objective terms 
such as speed, directness of response, 
etc., where an individual’s performance 
can be judged in relation to a behavioral 
norm or mean. This norm, combined 
with the experimenter’s knowledge of 
the goal of the animal—escape from 
Shock or procurement of food-—leads 
Somewhat inevitably towards a concept 
of behavior which is more or less adap- 
tive. But evaluative criteria which refer 


to the competence or efficacy of, for 
example, human social behavior, are 
extremely difficult to specify. 

In one widely used sense, maladap- 
tive behavior is that which appears 
as an extreme variant of a population 
norm. But the term has other con- 
notations. One of the most persistent 
involves the notion of energy balance 
and expenditure. 

Concept of Energy in Adaptive Be- 
havior. It has often been specifically 
stated that an essential feature of 
adaptation is the delimitation of energy 
output to the minimum capable of 
meeting the requirements of a situa- 
tion; and Thorpe (1956), for example, 
expresses the view that habituation is 
“economical” in that it saves the. ani- 
mal’s energies. This links up with the 
view that maladaptive behavior results 
in a depletion of the organism’s energy 
resources, and more generally with the 
common belief that normal behavior rep- 
resents a kind of “biological equilib- 
rium” or “energy balance” (Freeman, 
1948). 

Such concepts must be used with cau- 
tion, for it is tempting to ascribe to be- 
havioral energy the properties of phys- 
ical energy; in the resulting confusion, 
more analytical studies of behavior may 
be hindered (Hinde, 1960a). 

Energy models play a less obvious 
part in physiological theories, but there 
are many implicit references to energy 
processes. Cannon’s emergency theory, 
for example, was fundamentally con- 
cerned with the redistribution of bodily 
resources to make energy rapidly avail- 
able for muscular effort, and Selye’s 
(1956) theory of the stress response and 
“adaptation energy” involves the belief 
that no living organism can be main- 
tained continuously in a state of alarm 
because the drain upon internal energy 
resources is too great for it to cope with. 

Tt seems hardly necessary to labor the 
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point that psychological factors can alter 
the pattern of behavior without any 
known change in hypothetical stores of 
energy (Kubie, 1947). We are too far 
from understanding general or adaptive 
behavior in terms of molecular patterns 
of “energy” changes in the organism to 
engage in sharp controversy over this 
point. But Hinde’s warning must stand: 
there is no easy way out of analyzing 
behavior’s intricacies by adopting facile 
models of any kind. 


Adaptation as Response Decrement 


The next few sections will deal with 
some of the processes, labeled adapta- 
tion processes, which have been studied 
in detail; it will be noted that some of 
these processes are little, if at all, mod- 
ified by learning—others considerably 
so. 
Reflex Adaptation. Many kinds of 

responses have been studied over a wide 

range of animals (see reviews by Harris, 

1943; Thorpe, 1956) from the now 

classical studies of Humphrey (1933) 

on the land snail and of Dodge (1931) 

on the knee jerk, eyeblink, and nystag- 

mus responses, to more recent work on 
abdominal, biceps, and patellar reflexes 

(Hagbarth & Kugelberg, 1958; Lehner, 

1941) and on the pupillary reflex to 

light (Cuppers, 1954; Lowenstein & 
Loewenfeld, 1951). 

Many workers have studied the 
habituation of the widespread behav- 
ioral autonomic, muscular, and electro- 
encephalogram components of orienting, 
alarm, or startle responses (Davis, 
Buchwald, & Frankmann, 1955; Dyk- 
man, Reese, Galbrecht, & Thomasson, 
1959; Hinde, 1960b; Landis & Hunt, 
1939; Scholander, 1961). The immedi- 
ate autonomic and skeletal muscle reac- 
tion to a stimulus is diffuse, but after 
only a few repetitions of the stimulus 
this activity becomes less widespread, 
lower in amplitude, and shorter in dura- 


tion until eventually there may be a 
complete failure to respond. In such 
cases, it seems as if the animal has 
learned not to respond. 

There are also many recent electro- 
physiological studies on adaptation proc- 
esses (Hernandez-Peon, Scherrer, & 
Velasco, 1956; Sharpless & Jasper, 
1956). All agree on the kinds of factors 
which affect habituation, namely: (a) 
on the stimulus side—stimulus modal- 
ity, intensity, duration, and inter- 
stimulus interval; and (b) on the re- 
sponse side—within-organism factors 
such as level of wakefulness or fatigue, 
intake of drugs, and previous adapta- 
tion level. Recovery of the habituated 
response can occur under certain con- 
ditions which include the lapse of time 
(spontaneous recovery), and the intro- 
duction of an extraneous stimulus or any 
sudden change in the total experimental 
situation (disinhibition). 

Sensory Adaptation. All receptors 
manifest properties of adaptation, which 
means that a constant repetitive stimu- 
lus set off a series of receptor responses 
which gradually slows in time and even- 
tually ceases. But the term adaptation 
in sensory functions has many meanings 
(Jenkins, 1951). For example, it may 
refer to the “physiological zero” of the 
subjective temperature scale, a kind of 
frame-of-reference concept which has 
been developed by Helson (1948) in his 
“adaptation-level” theory of perception. 
Or, as in the case of dark adaptation, it 
may refer to a process of recovery, ie. 
increasing sensitivity. 

The results of sensory research show 
that with careful control both of stimu- 
lus characteristics (e.g., intensity, dura- 
tion, and wave length of light) and of 
receptor conditions (pupil size, retinal 
localization, etc.) rather regular and 
reproducible adaptation curves can be 
obtained. A number of restricted laws 
and theories have been developed to 2° 
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count for specific forms of sensory 
adaptation, such as the well-known 
photochemical theories of visual adapta- 
tion (Hecht, 1937). 

There are no quantitative laws which 
can be generally applied to all forms of 
sensory adaptation—understandably so 
in view of the highly specific nature of 
receptors. From time to time, however, 
generalizations about sensory adapta- 
tion appear in the literature. One is 
that the temporal curve of adaptation 
is negatively accelerated, proceeding 
rapidly at first and gradually slowing; 
another is that the time required for 
adaptation increases with the intensity 
of the adapting stimulus; a third that 
rate of adaptation decreases as the num- 
ber of receptors stimulated increases 
(Osgood, 1953). 

It should be emphasized, however, 
that there exist not one but a great many 
adaptation curves of different shape and 
level, depending upon the conditions 
under which they were obtained. This 
comment should not be taken as imply- 
ing that generalizations of this kind are 
unattainable, rather that they must be 
accompanied by some statement as to 
the range of conditions under which 
they hold. 


Differentiation of Adaptation Processes 


It seems to be commonly accepted 
that the categories of adaptation and 
habituation are different, although there 
have been isolated suggestions that some 
neural factor, such as increased thres- 
hold of synaptic transmission, is bas- 
ically responsible for all types of onto- 
genetic adaptation processes. However, 
Humphrey (1933) and Harris (1943) 
both concluded after reviewing response 
decrement from Protista to man that 
habituation is found in organisms with 
Presynaptic and synaptic nervous sys- 


tems, and in those with no nervous sys- 
tems at all! 


Attempts to discriminate adaptation 
from habituation have been based on 
several factors: first, the location of 
events within the central nervous sys- 
tem. It is sometimes suggested that after 
excluding peripheral processes such as 
receptor and effector adaptation a “true” 
kind of habituation is reached which is 
distinctively central (Harris, 1943; 
Jenkins; 1951, Thorpe, 1956). Such 
views are ineluctably related to a some- 
what outmoded reflex-circuit view of 
central nervous system organization, 
whereas current neurophysiological re- 
search points consistently towards ex- 
tensive modification of incoming signals 
at every level from the receptor upwards 
(Hernandez-Peon et al., 1956; Livings- 
ton, 1959; Magoun, 1958). 

A second attempt at differentiation 
has been based on time; for example, 
Harris (1943) argued that 
certain stimulus-response integrations, acti- 
vated at intervals long enough to preclude 
loss of ability of either receptor, conductor, or 
effector to respond maximally, will still decre- 
ment to zero level. To such decrement, uncom- 
plicated by interaction with other integrations, 
we apply the term habituation [p. 387]. 


Such arguments have little force to- 
day. Many recent behavioral studies 
which deal with generalized responsive- 
ness to stimuli, i.e., spatiotemporal pat- 
terns of electrical changes throughout 
the brain and widespread autonomic and 
muscular effects, show how the different 
elements of the total response are differ- 
entially habituated in time, some adapt- 
ing rapidly and others more slowly. The 
reasons for this are not yet clear, but 
there are probably extensive and com- 
plex interactions between many of the 
response elements. ee 

These arguments for discriminating 
adaptation and habituation processes are 
basically concerned with learned versus 
nonlearned responses. Thorpe (1956); 
for example, refers to habituation as 4 
simple learning not to respond to stimuli 
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which tend to be without significance in 
the life of the animal. Others, e.g., 
Humphrey (1933), Bradley and Key 
(1958), also regard habituation as nega- 
tive learning. It will be argued in the 
next section that truly adaptive behavior 
is that acquired in the experience of the 
individual and thus reflects a kind of 
learning process, and that nonlearned 
patterns of adaptation (such as sensory 
functions) are inherited evolutionary 
adaptations. No useful purpose can be 
found for distinguishing habituation as 
an independent term from adaptation. 


TERMINOLOGY 


Such a widely used term as adapta- 
tion must not, to quote Quine (1936), 
be allowed to rest upon an uncritical as- 
sumption of mutual understanding. It 
seems important, therefore, to establish 
some kind of classification of adaptation 
processes if only to aid communication 
and avoid the present confusion which 
attaches to the term. 

Speaking from an evolutionary view- 
point, Simpson (1958) and Pittendrigh 
(1958) are both of the opinion that 
adaptation can refer to two broad 
classes, either to the processes or end 
results of processes which bring about 
and maintain an organism-environment 
relationship useful to individual organ- 
isms and populations. 

The essential difference between these 
two groups is that the first are onto- 
genetic processes which occur within the 
organism and are not transmitted ge- 
netically to offspring. Pittendrigh in- 
cludes in this group such processes as 
general homeostasis, habituation, anti- 
body formation, and learning, i.e., prac- 
tically all forms of what he calls “so- 

matic adaptability.” 

He believes that the second group— 
end results of processes—comprises well- 
defined anatomical structures of claw, 
bill, tooth, or limb; behavioral features 
of the organism which serve proximate 


ends such as food seeking, mating, and 
attack; and even some general features 
such as learning capacity. These phylo- 
genetic adaptations evolve through very 
long periods of time until they reach a 
point of stability and little change, 
which also tends to last for appreciable 
periods of time. This way they may out- 
date their original purpose, as we can 
observe in many of man’s adaptations 
which carry over from a more primitive 
and pungent existence. 

There is of course a transitional group 
between these two large groups to in- 
clude adaptation processes of Group 1 
which are in course of becoming Group 2 
adaptations: differential reproduction 
may favor a particular characteristic so 
that it eventually results in a change in 
the genotype. This is a historical process 
which would span many generations. 

This simple classification into onto- 
genetic and phylogenetic adaptations is 
inadequate from a behavioral orienta- 
tion. It is pointless to place antibody 
formation in the same class as learning, 
which to the psychologist introduces 
new and plastic properties of behavior 
such as long-term modification of re- 
sponsiveness, opportunities for making 
varied and alternative responses to reach 
goals and objects, and the acquisitions of 
new goals not obviously related to bio- 
logical needs. 

Those processes such as antibody for- 
mation, receptor functions, etc., which 
are not subject to much modification 45 
a result of experience, or, especially, of 
learning, should not be placed in the 
first group; they belong instead to the 
second group of phylogenetic adapta- 
tions. These are usually studied in his- 
torical perspective—inferred from fossils 
of bygone ages which provide a succes- 
sion of “stills” of life. But some of the™ 
are available for study in living orga“ 
isms. 

For example, comparative studies 


ee 
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point to the incredible variety of photo- 
receptors to be found among animals, 
ranging from diffuse photosensitivity of 
the whole body through to the fully de- 
veloped image forming eye with lens, 
iris, and other complexities (Simpson, 
1953). Each is characteristically useful 
in a particular kind of environment and 
way of life, and together they represent 
multiple solutions of adaptational prob- 
lems. In the experimental context they 
can be studied as phylogenetic, not onto- 
genetic, adaptations. To ask “are these 
visual processes adaptive?” is to ask a 
question of evolutionary biology, not of 
psychology or physiology. 

Some of the multidimensional factors 
which are involved in the analysis of 
adaptation can be summarized as fol- 
lows: 

1. Ontogenetic adaptation processes: 
the new adjustments and responses the 
organism makes as a result of its own 
learning and experience. These changes 
are (a) not necessarily common to all 
members of the population; they may 
remain restricted to individuals. (b) 
They are not genetically transmitted. 
The medium of their occurrence is the 
central nervous system; they may be 
conveyed to other individuals, however, 
through social-cultural media. (c) They 
are of a short-term stability compared 
with evolutionary adaptations; they can 
result in relatively stable and persistent 
habits of the organism, and thus refer 
to a kind of behavioral steady state. 

2. The end results of adaptation proc- 
esses—phylogenetic adaptations: exam- 
ples are limb structure, physiological 
systems, and patterns of behavior such 
as escape, defense, etc. These (a) me- 
diate biological ends—survival and re- 
Production of the population; (b) are 
common to the population; (c) are 
genetically transmitted; and (d) are 
Stabilized patterns of organization which 
often persist indefinitely. 


IMPLICATIONS FOR PSYCHOLOGY 


The term adaptation has two main 
uses in psychology. The more common 
refers to response decrement through 
repeated elicitation. The second use is 
more evaluative (and applied to be- 
havior based on a combination of re- 
sponse increment and decrement). For 
example, when a behavior pattern de- 
velops to some specific environmental 
demand it is judged to be adaptive or 
maladaptive on the basis of a number of 
nonquantifiable concepts. 

From this complex we derive certain 
impressions as to what constitutes well- 
adapted behavior which as yet can 
hardly be formulated into any precise 
experimental program. For in the first 
place they deal with complex and molar 
patterns of behavior resulting from 
“real-life” organism-environment inter- 
action, and in the second place they 
frequently involve the content rather 
than the process of learned behavior. 

The notion of adaptation as response 
decrement is quite different. It is a more 
molecular, more easily isolated be- 
havioral process which can be experi- 
mentally investigated. There is clearly 
a very strong case for avoiding confusion 
by insisting that the neutral term “re- 
sponse decrement” replace the more 
common adaptation and habituation in 
the psychological literature. Moreover, 
it would help establish continuity be- 
tween different experimental approaches 
to response decrement and increment, 
and highlights a point recently made by 
Harlow (1959) that there exists no 
fundamental difference other than com- 
plexity, between sensory adaptation, 
habituation learning, conditioned re- 
sponse learning, and the kinds of learn- 
ing described as reasoning and thinking. 
Looked at in this way, the study of sen- 
sory response decrement, which an some 
areas is so advanced and quantitative, 
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could represent a valuable methodologi- 
cal paradigm. i 
For all the lip service paid to bio- 
logical adaptations—drives, needs, in- 
nate mechanisms, etc.—psychologists in 
recent years have directed little atten- 
tion to instinctive behavior as such. It 
is the ethological group which has be- 
come increasingly concerned with the 
analysis of instincts, and thus with more 
highly integrated sequences of behavior. 
Ethology has far more immediate affini- 
ties with the biological significance of 
behavior than has psychology; but there 
are signs of change. Breland and Bre- 
land (1961) after running into diff- 
culties in conditioning animals com- 
ment: 
After 14 years of continuous conditioning and 
observation of thousands of animals, it is our 
reluctant conclusion that the behavior of any 
species cannot be adequately understood, pre- 
dicted, or controlled without knowledge of its 


instinctive patterns, evolutionary history, and 
ecological niche [p. 684]. 


Perhaps their difficulties center on the 
question of what is an adequate explana- 
tion or understanding of behavior. They 
were struck by the sort of behavior 
their animals demonstrated; most psy- 
chological experimenters ignore the con- 
tent of learning and concentrate on the 
“how,” and in this respect they have 
been hopeful of establishing general 
laws of wide applicability. These are 
two complementary, equally valid ap- 
proaches; the Brelands’ findings are not 
incompatible with either, but have led 
them to the limits of certain general 
laws. We are reminded once more not 
only that drive and learning theories will 
one day be trimmed down to a sensible 
size, but that they will almost certainly 
be better shaped to take account of 
innate phylogenetic adaptation patterns. 

Learning theories will undoubtedly 
also benefit from a closer look at the na- 
ture of the response decrement to the 
unconditioned stimulus which occurs in 


many classical conditioning situations. 
Although it is true that some reflexes 
modify little upon repeated stimulation, 
many alter considerably. Indeed, in the 
case of some responses—the galvanic 
skin response, electroencephalogram 
blocking, electromyograms, and the ab- 
dominal skin reflex, for example—re- 
sponse decrement is an obtrusive factor, 
and hand in hand with this goes diffi- 
culty in achieving or maintaining the 
conditioned response (if conditioned 
it is). 

These factors have led to debates not 
only on what specific criteria should be 
applied to classical conditioning, but on 
the more general problem of the physio- 
logical measurement of drive. The 
galvanic skin response not only rapidly 
diminishes to successive stimuli but re- 
appears if the stimulus situation is 
changed in any way, an erratic pattern 
which is reflected in the difficulty of 
establishing a conditioned response 
which excludes sensitization and pseudo- 
conditioning (Stewart, Stern, & Wino- 
kur, 1961). This raises questions about 
traditional definitions of classical con- 
ditioning, and whether pairing of the 
conditioned stimulus and the uncondi- 
tioned stimulus can or should be given a 
status different from other processes of 
response increment (see Hagbarth & 
Kugelberg, 1958). 

Further, when both the specific re- 
sponse being conditioned and the gen- 
eral pattern of physiological and be- 
havioral activity modify upon repeated 
application of the unconditioned stimu- 
lus, it seems reasonable to infer that 
the organism’s level of drive is also 
changing throughout the experiment: 
These physiological data may be diff- 
cult to incorporate into unitary be- 
havioral concepts of drive, but should 
provide more direct links with the neuro- 
physiological approach to learning and 
conditioning (Malmo, 1958). 
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Thus there are some broad and some 
specific implications for psychology 
arising from a review of the multi- 
dimensional approach to biological 
adaptations. One is obvious: that a 
preoccupation with general behavioral 
laws and theory should not preclude a 
wider sympathy for other less systematic 
approaches in biology; that interdis- 
ciplinary research with psychology’s 
near neighbors—ethology and genetics 
—must be fruitful; that the biological 
significance of behavior, albeit compli- 
cated and unwieldy from an experi- 
mental point of view, must sooner or 
later be tackled experimentally. 

More specifically, studies of response 
decrement show it to be an obtrusive 
factor, especially in classical condition- 
ing. In some response modalities, condi- 
tioning seems only to be one of several 
processes of response facilitation which 
can be offset by opposing processes of 
response decrement. These considera- 
tions should lead to a reanalysis of drive 
and arousal concepts in current learning 
theory. 

To conclude, the pattern of alarm, 
alerting or startle reaction which occurs 
to unexpected and novel stimuli, can be 
used to illustrate in summary form the 
different aspects of adaptation. 

The elements of this phylogenetic 
adaptation pattern can be studied com- 
paratively so that its content and ex- 
pression can be related to the type of 
life the animal leads (or has led) and 
the ecological niche which it occupies. 
Thus one animal will be alerted by 
movement, another by a particular 
shape, and so on. In addition, the modi- 
fications which occur in the overall pat- 
tern to repetitive stimuli as a result of 
learning and experience can be studied 
experimentally as processes of response 
decrement. Undoubtedly some elements 
(e.g., the blink of the startle response, 
the effects of certain “sign stimuli”) 


remain undiminished whereas other as- 
pects of the response pattern are rapidly 
inhibited. These processes of response 
modification are apparently systematic 
and can be expressed lawfully. Lastly, 
the relatively stable state of ontogenetic 
adaptation reached by the organism 
after considerable experience, such that 
the alerting response may not occur at 
all to certain stimuli, can be system- 
atically examined, and proposed criteria 
for assessing whether this behavior is 
adaptive or maladaptive made more 
explicit. 
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NEUROCHEMISTRY AND LEARNING 
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Hypotheses of molar and molecular neurochemical approaches to learn- 
ing are discussed. Molar approach equates learning with chemical 
changes of the synapse whereas molecular approach refers learning to 
changes in molecular structure of nucleic acids. 2 approaches are eval- 
uated relative to various learning phenomena. 


The topic of learning is a central 
problem for psychologists. The major 
learning approaches which have domi- 
nated psychology have been the treat- 
ments within a psychological frame of 
reference exemplified by Thorndike, 
Hull, Guthrie, Skinner, the Gestaltists, 
Tolman, and others. However, in recent 
years a number of other approaches have 
appeared on the scence, viz., mathe- 
matical, analogical, neurological, and 
neurochemical. 

The mathematical (or symbolic) ap- 
proach to learning which has generated 
much research in recent years has been 
of a probability or stochastic nature 
(e.g., Bush & Mosteller, 1955; Estes, 
1950; Overall, 1960; Restle, 1955). 
These mathematical models have served 
as convenient guides and have provided 
valuable insights concerning learning 
phenomena. 

In recent years psychology has been 
permeated by a rash of models in which 
different types of analogies provide in- 
sight into the functioning of the organ- 
ism. The main concern of some of these 
models has not been with learning; 
however, most of these analogies can 
include learning phenomena. These 
analogues include communication mod- 
els (Miller, 1953), servomechanisms 
(Weiner, 1948), computer models 
(Resenblatt, 1958), mechanical models 
(Broadbent, 1957), and others. These 
models, like the mathematical approach, 
have Provided interesting and ingenious 
ideas concerning behavior. 


Both the mathematical and analogical 
approaches are of extraorganismic na- 
ture, i.e., they utilize external aspects to 
apply to the organism. On the other 
hand, the neurological and neurochemi- 
cal approaches are intraorganismic in 
that they deal directly with the internal 
aspects of the organism. 

The neurological approaches to learn- 
ing have been numerous. The most in- 
fluential ones have been those by Lash- 
ley (1929) and by Hebb (1949). The 
cell assemblies and phase sequences of 
the latter have permeated much of psy- 
chological thinking. Recently Pribram 
(1960) has presented an interesting 
theoretical approach in an excellent re- 
view. Other excellent reviews of recent 
research on learning phenomena include 
articles by Morrell (1961), John 
(1961), and Thomas (1962). 

The neurochemical, or biochemical, 
approach to learning is a very recent one 
and appears to offer great promise for 
the future. In this area some very ex- 
citing research results have been ob- 
tained. We would like to review this 
approach, 

The neurochemical approach to be- 
havior may be said to have had its 
inception when Thudichum published 
a monumental treatise (Thudichum, 
1884; cited by Page, 1955). Thudi- 
chum described the first systematic 
attempt to understand the chemical 
mechanisms of nervous tissue. After 
Thudichum’s death in 1901 work on 
neurochemistry all but stopped. Ap- 
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proximately 35 years later, _ Scientists 
again began to study the chemical mech- 
anisms of nerve tissue. 

In recent years this renewed interest 
in neurochemistry has been reflected by 
a number of results which have been per- 
tinent to psychology. For example, Wil- 
liams (1956) has investigated the bio- 
chemical make-up of humans and lower 
animals and has indicated that under- 
lying the differences in behavior of 
organisms are differences in chemical 
activity. An exciting neurochemical 
approach is the attempt to identify 
specific chemicals such as serotonin, 
adrenolution, acetylcholine (ACh), etc., 
as the causative agents for psychotic be- 
havior (Rinkel & Denber, 1958; Rubin, 
1959). 

The neurochemical approach to learn- 
ing is extremely new, having developed 
within the past decade. There have 
been two major treatments of learning 
within this approach, one remaining at a 
rather molar chemical level whereas the 


other is concerned with submolecular 
changes. 


Morar CHEMICAL APPROACH 


The molar chemical approach deals 
with the relationshipof various chemicals 
such as ACh or calcium with the acquisi- 
tion and retention of specific behavior 
indicative of learning. This neurochemi- 
cal approach rests on the general hypoth- 
esis that variations in brain chemistry 
are major determinants of adaptive be- 
havior in animals. The California group 
(Krech, Rosenzweig, and others) are 
the foremost proponents of this ap- 
proach. 

The main thesis of the California 
group is that changes in ACh are related 
to learning and adaptive behavior. They 

have conducted a series of experiments 
combining biochemical analytic tech- 
niques and psychological procedures. 
They have hypothesized that experience 
and training may significantly alter the 


concentration of brain ACh and cholin- 
esterase (ChE; Bennett, Rosenzweig, 
Krech, Karlsson, Dye, & Ohlander, 
1958; Bennett, Krech, Rosenzweig, 
Karlsson, Dye, & Ohlander, 1958; 
Krech, Rosenzweig, Bennett, & Krue- 
ckel, 1954; Krech, Rosenzweig, & 
Bennett, 1956, 1959; Krech, Rosen- 
zweig, Bennett, & Longueil, 1959; 
Rosenzweig, Krech, & Bennett, 1956, 
1960). These individuals have main- 
tained that transmission of nerve im- 
pulses is accomplished by the discharge 
of ACh from presynaptic neurons. In- 
activation of ACh by ChE, as soon as 
the former stimulates the postsynaptic 
neuron, preserves discrete transmission 
of impulses. 

The California group was interested 
in the relationship between ACh and be- 
havior. However, no reliable techniques 
for the measurement of ACh were avail- 
able. Therefore, they decided to use 
amounts of ChE as an index of ACh 
levels because the former was considered 
to be a stable component which is easily 
and reliably measured. This decision 
was a tentative one made with some mis- 
givings. 

The biochemical analytic procedures 
involved decapitating an animal, taking 
samples from desired portions of the 
brain (of rats), weighing the sample, 
and homogenizing the tissue in certain 
chemicals. After homogenizing the tis- 
sue, a number of complex chemical re- 
actions were involved; the end result 
being a statement of the ChE activity 
per milligram of tissue. 7 

The initial study by the group indi- 
cated that rats with strong spatial (right 
or left) preferences in maze running hai 
greater ChE activity than rats wit 
strong visual (light or dark) preference? 
(Krech et al., 1954). The greater Ch 
activity was present in somesthetiC 
visual, and motor areas of the cortet’ 
These results were confirmed in a late" 
experiment in which pentobarbital 5° 
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dium was administered to depress ACh 
metabolism. Animals given the drug 
consistently showed visual preferences 
(Rosenzweig et al., 1956). Further 
results were in agreement with the first 
experiment (Krech et al., 1956). 

A very interesting result was obtained 
when Bennett, Rosenzweig, Krech, 
Karlsson, Dye, and Ohlander (1958) 
studied the ChE activity of two strains 
of rats at various ages. It was found 
that descendants of the Tryon Maze- 
Bright strain showed greater average 
ChE activity than did descendants of 
the Tryon Maze-Dull strain for all 
ages investigated (29-527 days). The 
curves for both strains over days showed 
a rapid rise, reaching a peak at approx- 
imately 75-100 days, and then grad- 
ually decreasing. The two curves are 
remarkably similar to the curve of 
mental growth reported for humans 
(Wechsler, 1944). However, the simi- 
larity may be coincidental. 

Another experiment indicated that 
subcortical ChE activity was negatively 
related to brain weight, but there was 
no relationship between cortical ChE 
and brain weight (Krech, Rosenzweig, 
& Bennett, 1959), 

In a recent summary of their ap- 
Proach, Rosenzweig, Krech, and Bennett 
(1960) have rejected their hypothesis 
that ChE activity furnishes a good index 
of ACh activity. They maintained that 
each is under separate genetic control. 
However, their basic hypothesis that the 
ACh transmission system is intimately 
involved in learning was retained. This 
change was necessitated by results which 
showed that for some strains of rats 
high ChE levels were positively related to 
learning ability but in other strains there 
Was a negative relationship. Further ex- 
periments suggested that the difference 
between ACh and ChE levels was of 
major importance, with greater learning 
ability being related to greater amount 
of ACh (within limits) at the synapse. 


The experimental work conducted by 
Krech and associates lends credibility 
to their hypothesis. However, Tower 
(1958) has raised questions concerning 
the approach. He maintained that even 
though evidence favors ACh as the trans- 
mitter substance, the idea is controver- 
sial; ACh seems to be of less or no im- 
portance to certain systems, including 
much of the sensory functions upon 
which maze behavior is dependent; the 
approach does not differentiate between 
Type I and Type II ChE but only the 
former participates in ACh hydrolysis; 
ChE in the brain is far in excess of re- 
quirements, thus making difficult the 
uncovering of relationships with activ- 
ity; the changes in ChE are so small as 
to be possible errors of sampling and 
analysis. Rosenzweig, Krech, and Ben- 
nett (1960) have countered this last 
criticism by indicating that even though 
the differences they have found are 
small, these have appeared consistently.’ 

Another approach related to the ACh- 
ChE hypotheses is that of Overton 
(1958, 1959a, 1959b). Overton’s ap- 
proach is very simply stated as a calcium 
displacement hypothesis: decrease in 
calcium concentrations at the synapse 
facilitates learning; increases in calcium 
retards learning. According to Overton, 
stimulation produces reverberatory cir- 
cuits which are the basis for immediate 
memory. If the activity of these circuits 
continues as long as 30 seconds, per- 
manent changes may occur at the 
synapse which wili maintain permanent 
memory for the stimulating event. These 
changes at the synapse consist of the 
displacement of calcium by acetic acid, 


1 More recent research by the California 
group has handled more of these criticisms and 
they have reported further outstanding results: 
they found that environmental stimulation 
leads to an increase in cortical weight, in thick- 
ness of cortex, in both Types I and II cn 
and a tendency toward an increase in the ratio 
of glia to neurons in the cortex (Rosenzweig, 


1963). 
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one of the by-products of the breakdown 
of ACh by ChE. Strong stimulation is 
supposed to produce greater concentra- 
tions of acetic acid which increases the 
probability of removing calcium deposits 
that occur naturally at the synapse. 

Overton gives the results of experi- 

mentation which are consistent with his 
calcium displacement hypothesis. He 
fed two groups of rats diets of calcium, 
the diet of one group containing 35 times 
as much calcium as the other. The 
group on low calcium diet performed 
better in learning a maze than did the 
high calcium group. He also found that 
the high calcium diet group contained 
more calcium in brain neural tissue than 
did the low calcium group. Although 
these results tended to be consistent 
with his hypothesis, he suggested that 
the different diets may have had dif- 
ferential effects on sensory threshold and 
discriminative ability rather than on 
memory of the maze. He also suggested 
that altered metabolism of other electro- 
lytes in the system brought about by ab- 
normal calcium diets may have caused 
the performance differences. A third 
alternative he offered was that the cal- 
cium diets may have affected the rats’ 
cage activity, producing differential rates 
of retroactive inhibition. 

The approach of both Overton and the 
California group are remarkably similar. 
Overton merely considers synaptic events 
other than ACh-ChE interaction and 
attempts to develop a more involved 
theoretical schema. His overall ap- 
proach reflects a merger of Hebbian and 
Hullian ideas. 

Two major difficulties are posed for 
proponents of this approach. The cen- 
tral thesis of both Krech and his associ- 
ates and of Overton is that the chemi- 
cals being measured operate at the 
synapse. Their techniques are not re- 
fined enough to conclusively indicate 
that this is so. Thus, the ACh-ChE and 
calcium activity could be more diffuse. 


However, Tower (1958) maintains that 
ACh appears to be concentrated at 
synaptic endings and that the supply 
seems to vary with synaptic activity. 
The second difficulty is one that con- 
fronted the Hebbian system (Hebb, 
1949). Milner (1957) indicated that a 
deficiency of the Hebbian system was 
its emphasis on neural excitation to the 
exclusion of inhibitory phenomena and 
maintained that these latter events could 
not be explained by lack of facilitation 
or by invoking the principle of refrac- 
toriness. Hence, he introduced an active 
mechanism for inhibition involving aD 
interaction between cells. The same 
criticism can be leveled at the molar 
chemical approaches. Both approaches 
appear to be handling inhibitory phe- 
nomena as a lack of facilitation. 


MOLECULAR CHEMICAL APPROACH 


The molecular or specificity neuro- 
chemical approach to the experiential 
code is not new. A number of people 
have expressed the idea that memory 
involves a molecular change in certain 
tissue, for example, Pauling and Weiss 
during the Hixon Symposium (Jeffress, 
1951) and Gerard (1960). In a recent 
book on the nature of chemical bond- 
ing, Pauling (1960) concludes: 
that thinking, both conscious and unconscious: 
and short-term memory involve electromag- 
netic phenomena in the brain, interacting wit? 
the molecular (material) patterns of long-term 


memory, obtained from inheritance or €x- 
perience [p. 570]. 


A few years ago protein molecules 
were considered the most likely candi- 
dates to contain the memory trace. Kat? 
and Halstead (1950) and Halstead 
(1951) favored the hypothesis that 
nucleoproteins were the substances 
which had the ability to act as template’ 
on which replica molecules were forme®: 
At first the neurons of the brain wet? 
supposed to contain random configu!” 
tions of protein. Stimulation of neut? 
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tissue by impulses caused the randomly 
oriented molecules to assume a specific 
configuration. Nucleoproteins were in- 
volved in these reorientations and be- 
came templates, thus providing a mem- 
ory mechanism. They believed that 
these templates are like those of the 
germ cells in representing native endow- 
ment but differ from the latter in aris- 
ing from external stimulation. They 
stated that the ordering of the protein 
templates could take place in various 
components of the cell and its processes, 
including the synapse. However, the re- 
organized protein replicas ultimately 
reside in the neural membrane where 
they participate as “traces.” 

Recently the protein molecule has 
been supplanted by the nucleic acids as 
the potential memory mechanism. In 
the last few years a number of individ- 
uals have independently suggested the 
hypothesis that deoxyribonucleic acid 
(DNA) or ribonucleic acid (RNA) 
function as the molecular mechanism in 
which experiential events are symbolized 
(Dingman & Sporn, 1961; Gaito, 1961; 
Hydén, 1959). 

Before launching into a discussion of 
these hypotheses, some background 
information concerning nucleic acids is 
in order. 

It has been conclusively established 
by virologists and other microbiologists 
that DNA is the genic material (Butler, 
1959; Crick, 1954, 1957; Lederberg, 
1960; Sager & Ryan, 1961; Stanley & 
Valens, 1961; Strauss, 1960). DNA is 
a large double strand molecule wound 
in a helix, with each strand being the 
complement of the other. Each strand 
has a recurring sequence of a phosphate 
followed by deoxyribose sugar. The 
Strands are attached at the sugars by 
means of bases called purines and py- 
timidines. The former are the larger and 
consist of adenine and guanine; the 
latter, of thymine and cytosine. Thus, 
the only apparent difference in DNA 


molecules is in the sequence of these 
bases. If we consider the permutations 
of the four bases, with the restriction 
that adenine must be linked to thymine 
and guanine to cytosine (a restriction 
which occurs in nature), there are only 
four possibilities. Even though there 
are only four possible bases, the strands 
of the molecule are extremely long and 
allow for many possibilities. DNA 
molecules consist of thousands of units 
(Kornberg, 1959). In the human it has 
been suggested that there are approx- 
imately 800,000 DNA molecules, each 
with 40,000 of these units (Stanley & 
Valens, 1961). But let us assume that 
only 1,000 units are present. In this 
case, the number of possibilities for 
a single DNA molecule is 4%°°, as- 
suming independence of sites. Such 
figures are so enormous that they exceed 
the limits of comprehension. 

If we assume that the direction of 
bonding (adenine-thymine is not dif- 
ferent than thymine-adenine, for ex- 
ample) is not of importance, the number 
of possibilities would be 2", where 7 
refers to the number of independent 
units. Again the result is an astronomi- 
cal figure. 

However, we must allow for the pos- 
sibility that the various sites or units are 
not independent of each other. This ap- 
pears to be the case in that Chargaff 
(1958) has indicated that runs of three 
or more purines followed by three or 
more pyrimidines occur frequently. 
Such lack of independence would reduce 
the overall number of possibilities but 
the remaining number of possible events 
would still be tremendously large. 

Thus it is obvious why microbiologists 
believe that these bases may furnish the 
“language” of the genes. Crick (1954) 
has likened the bases to the dots and 
dashes of the Morse code. He states 
that there is enough DNA in a single 
cell of the human body to encode about 
1,000 large textbooks. 
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The other nucleic acid, RNA, is a 
single strand analogue of DNA. DNA 
has deoxyribose sugar and the bases, 
adenine, guanine, thymine, and cytosine; 
RNA has ribose sugar and three of the 
DNA bases but uracil instead of thy- 
mine. 

DNA is found in the nucleus of the 
cell while RNA is located both in the 
nucleus (in the nucleolus) and in the 
cytyoplasm of the cell. In the nerve cell, 
Hydén (1959) has indicated that 90% 
of the RNA is located in the soma of the 
cell; the remaining 10% is found in the 
dendrites. He was unable to find any 
measurable amounts in the axon. The 
RNA in the cytoplasm is mainly con- 
centrated in the Nissl substance, 

DNA and RNA are intimately in- 
volved in cellular functions which are 
of vital concern to the organism. For 
example, in protein synthesis it is ac- 
cepted that DNA sets the process in 
motion by giving “instructions” to the 
RNA molecules which then interact with 
amino acid molecules to form proteins. 
Thus it is assumed that DNA passes the 
genetic code to the RNA molecule via 
the linear sequence of bases. Research 
results support this assumption. 

A number of investigators have shown 
that the base ratio of DNA of a bacterial 
virus, T2, is analogous to that of its 
RNA. This suggests that the similarity 
may go further, even to a detailed cor- 
respondence of the bases in the two. 
That this correspondence is so was indi- 
cated by Hall and Spiegelman (1961). 
They used purified T2-RNA in order 
to provide opportunity for the RNA to 
combine with its DNA complement, un- 
hindered by nonspecific interactions with 

irrelevant RNA. Their data demon- 
strated that molecules are formed con- 
sisting of T2-RNA and its homologous 
DNA. These results are strong evidence 
for the complementarity of the base 
sequences in RNA and DNA. They also 
suggested that the interaction between 


DNA and RNA in which a strand of 
RNA is fashioned in the complementary 
image of DNA may be the means where- 
by the genetic code is transmitted to the 
RNA, which then directs the synthesis 
of proteins. The exact mechanisms 
involved in the synthesis and function- 
ing of messenger RNA were suggested 
by Hurwitz and Furth (1962). 

During the last 13 years, Holger 
Hydeén (1959, 1961) of the University 
of Göteborg has been doing extensive 
research concerning RNA and proteins. 
Using very refined micro-molecular bio- 
chemical techniques, he stimulated an- 
imals and then analyzed the contents of 
nerve cells. His results demonstrated 
that RNA and proteins are produced in 
the nerve cells at a rate which follows 
neuronal activity. He maintained that 
the nerve cell fulfills its function under 
a steady and rapidly changing produc- 
tion of proteins, with the RNA as an 
activator and governing molecule. He 
hypothesized that memory involves a 
change in the sequence of bases in the 
RNA molecule through frequency mod- 
ulation; one or more bases are ex- 
changed with the surrounding cytoplas- 
mic materials. The details of the forma- 
tion of the memory trace are as follows: 

1. Specification of RNA in the Nerve 
Cell: The first series of impulses gener- 
ated in sensory cells or in motor neurons 
pre- or postnatally change the stability 
of one or more of the four bases of the 
RNA molecule at a certain site along 
the molecule. Precisely which changes 
in stability will occur depend on the 
modulated frequency of the first series 
of impulses generated in the nerve cell. 
This effects a change of one against an- 
other base from the surrounding pool. 
The new base at this space is now stable 
under the influence of the modulated fre- 
quency. The new pattern remains an 
constitutes the specification of the RNA 
in the nerve cell. 


2. Formation of RNA Protein under 
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Neural Activity: Since the sequence of 
the bases in the template RNA is now 
changed, new protein formed through 
the mediation of the RNA will also be 
specified. 

3. Dissociation of Protein: Stimula- 
tion causes a rapid dissociation of the 
specified protein and the combination 
of the dissociated products with a com- 
plementary molecule or an energy ac- 
tivation of the specified dissociated pro- 
tein. 

4. Activation of Transmitter Sub- 
stance: Through a rapid combination of 
the dissociated protein with a comple- 
mentary molecule in the third step, an 
activation occurs of the transmitter sub- 
stance, and the postsynaptic structure 
is excited. 

Hyden holds that the neurons are in- 
terconnected in complex tridimensional 
chains. Thus, when neurons in an al- 
ready established memory trace are 
activated, regardless of how many there 
are, or where they are located, the hy- 
pothesis implies the reaction consists of 
the rapid third and fourth steps in the 
scheme. The impulses in the form of 
modulated frequencies activate the 
specified protein and release the trans- 
mitter substance causing the next neuron 
in the chain to be activated or excited. 
The transmitter substance may be ACh 
or any assumed inhibitory substance. 

Hydén apparently agrees with some 
type of Hebbian approach regarding the 
linking of cells to form chains or circuits 
in that he maintained that the reason 
that only neurons of a chain respond is 
because protein, once specified by a 
modulated frequency, responds to the 
same electrical pattern whenever re- 
peated but will not react to other pat- 
terns. 

In Hydén’s system strict localization 
of function in the brain is not required. 
The same neuron could participate as a 
link in many neuronal networks, depend- 
ing on the response of the sequence of 


the bases in the RNA to the modulated 
frequency. Association might grow out 
of successive specification of neurons. 
We may summarize Hydén’s theory 
in less technical terms as follows. A 
nerve cell responds differentially depend- 
ing on whether the pattern of impulses 
it receives is novel or familiar, as well 
as on the pattern itself. No protein will 
have the correct configuration if the 
incoming pattern of impulses is new; 
therefore no dissociation of the protein 
(followed by transmitter substance ex- 
plosion) can occur. The electrical pat- 
tern must first shape a new RNA mol- 
ecule, which in turn shapes a protein 
molecule that can dissociate. The mol- 
ecule fragments and then reacts with a 
complementary molecule, causing the 
triggering of a substance (excitatory or 
inhibitory) across the synapse. If, on 
the other hand, the incoming impulse is 
familiar, protein molecules will already 
be present that are capable of dissociat- 
ing rapidly. Each cell may perpetuate 
within itself a large number of unique 
patterns of RNA and protein. A giant 
RNA molecule could accommodate along 
its length many different sequences 
shaped by different impulse patterns 
that have coursed through the neuron. 
Gaito (1961) suggested that changes 
might occur at the attachment of the 
two strands of DNA, with adenine at 
one locus changing to guanine and the 
associated pyrimidine changing from 
thymine to cytosine. Deletion, addition, 
and rearrangement of bases could be 
other means of changing the code. Thus. 
these changes would provide a basis for 
modification of the genetic potential in 
nerve cells by means of external stimula- 
tion, learning. This would be a somatic 
mutation which would not be trans- 
mitted to the offspring. However, other 
possible biochemical mechanisms were 
suggested which could be involved = 
memory functions, changes either in 
RNA or in the amino acids to subserve 
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memory functions rather than any 
changes in the DNA. The changes in 
DNA, RNA, or amino acid molecules 
could occur either in the nucleus or cyto- 
plasm of the nerve cell. It is also pos- 
sible that some of these modifications 
might occur in the cell processes, even 
at the synapse. 

Dingman and Sporn (1961) hypoth- 
esized that RNA changes were the 
basis for memory. However, they sug- 
gested that the linear sequence of bases 
(primary structure) was only one pos- 
sible means of codifying experiential 
events. They suggested that changes in 
the helical structure (secondary struc- 
ture) and overall configuration (tertiary 
structure) could be the basis for 
memory. 

Having looked at the hypotheses that 
some modification of DNA and/or RNA 
constitutes the basic change in learn- 
ing, let us examine the empirical data 
relative to the nucleic acids. Evidence 
superficially seems to favor the RNA 
molecule rather than DNA, as a possible 
experiential code. 


DNA 


Of the two nucleic acids, DNA is the 
most stable. The average DNA content 
appears to be relatively stable even dur- 
ing marked physiological alterations of 
cells whose nuclear size and protein 
content may vary widely (Alfert, 1957). 
There is general agreement concerning 
the absence of DNA turnover in rapidly 
growing cells and a limited rate of turn- 
over in slowly growing cells or in rest- 
ing bacteria (Sinsheimer, 1960). How- 
ever, Sinsheimer reports a drop of 20% 

in DNA content in certain cells of the 
rat liver upon treatment with cortisone 
and an increase of 11% in the nuclei of 
the adrenal medulla of rats maintained 
at 4° C. for 2 weeks. But he implies that 
these changes may be merely changes in 
solubility rather than in actual content. 

That adenine can be converted to 


guanine in vivo, probably at the nucleo- 
tide level (phosphate, sugar, and a 
single base), is suggested by results in 
which rats utilized adenine containing 
heavy nitrogen (N°), and the nitrogen 
was later found in the adenine and 
guanine of tissue nucleic acids (White 
et al., 1959). They maintained that 
the incorporation of adenine into adult 
rat tissue is mainly in RNA. Incor- 
poration of adenine into DNA is usually 
slow or negligible; however, in partially 
hepatectomized animals undergoing 
rapid liver regeneration, there is exten- 
sive incorporation of adenine into DNA. 
On the basis of these and similar results, 
White et al. suggested that DNA is 
formed to an appreciable extent only 
during active mitosis by a cell. 

White, Handler, Smith, and Stattan 
(1959) also reported that the cytosine 
nucleotide can be obtained from chemi- 
cal reactions involving the uracil nucleo- 
tide. In rats a radioactive isotope 
labeled cytosine compound was a pre- 
cursor of both cytosine and uracil. 

Research with purine and pyrimidine 
analogues, which are basically similar 
to the natural bases but differ in one 
or more side bonds, has provided some 
interesting results. Dunn and Smith 
(1954) have shown that in Escherichia 
coli, 5-bromouracil is incorporated into 
DNA and replaces thymine. They found 
that the addition of either 5-bromouracil 
or 5-iodouracil inhibited the natural 
reactions of Æ. coli. The mean genera- 
tion time of the bacteria was increase 
2 to 4 hours after addition of the base 
analogues; the degree of inhibition in- 
creased with greater amounts of base 
analogues. However, inhibition could 
be reversed by thymine. After inhi- 
bition by either analogue, the pro- 
portion of bacteria giving colonies 0” 
nutrient agar was reduced. When all of 
the thymine was replaced by bromo- 
uracil, only 9% of the phage particles 
yielded viable progeny (Strauss, 1960): 
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The base analogues have been used 
frequently in the DNA of bacterio- 
phages (Sinsheimer, 1960). For ex- 
ample, 5-bromouracil and 2-aminopurine 
are presumed to act to bring about the 
replacement of the adenine-thymine 
pairs by guanine-cytosine ones, and vice 
versa. These agents also bring about a 
reversion of these changes. Freese 
(1961) has indicated that low pH, ethyl 
ethane sulfonate, and other agents, will 
cause transitions from one pair to the 
other. 

Benzer (1961, 1962) has made a 
detailed examination of a small portion 
of the genetic map of bacteriophage T4, 
a portion which controls the ability of 
the phage to grow in E. coli. He indi- 
cated that A-T pairs are held much less 
strongly than are G-C pairs which sug- 
gests that in mutation the A-T pair “hot 
spots” will be more subject to substitu- 
tion. He stated that A-T pairing would 
change to A-G and thereon to C-G. 

In a few organisms, for example, 
tobacco mosaic virus, polio virus, and 
influenza virus, no DNA is present. In 
this case RNA appears to be the heredi- 
tary material. Nitrous acid has been 
used as a mutagen with tobacco mosaic 
virus RNA. This reagent substitutes 
hydroxyl groups (OH) for amine groups 
(NH2). Strauss (1960) has indicated 
that nitrous acid reacts with nucleic 
acids containing adenine, guanine, and 
cytosine and converts them to the cor- 
responding base analogues containing 
hydroxyl compounds, hypoxanthine, 
xanthine, and uracil, These results im- 
ply that the reaction of nitrous acid with 
nucleic acids produces base analogues 
which result in mutation upon the du- 
Plication of genetic material. Tsugita 
and Fraenkel-Conrat (1960) have shown 
that nitrous acid alters the composition 
of RNA of tobacco mosaic virus and that 
the resulting protein of the mutant dif- 
fered from the parent strain with three 
amino acids being replaced by three 


others (proline, aspartic acid, and threo- 
nine by leucine, alanine, and serine). 
These changes have involved the use 
of chemical agents to bring about a 
change of base composition. But there 
is no information to indicate that modifi- 
cation in the DNA molecule can occur 
with moderate stimulation except for 
the White et al. (1959) material, dis- 
cussed above. In fact biochemists are 
skeptical about such changes because of 
the DNA stability. However, the DNA 
molecule should not be eliminated com- 
pletely for several reasons. First, most 
of the DNA which has been investigated 
has been from nonneural tissue. It is 
possible that the activity of DNA in 
nerve cells is different than its activity 
in other cells and may be more con- 
ducive to modification. Mirsky (1953) 
indicated that the metabolic activity of 
protein varies greatly in different nuclei 
of an organism, suggesting that the 
chromosomes of different cells vary in 
their activity. He showed that differen- 
tial activity did occur. With heavy 
nitrogen in glycine the DNA in mouse 
liver chomosomes took up nitrogen three 
times as fast as that in the kidney, even 
though the liver and kidney have equal 
amounts of DNA. Also the cells of a 
digestive gland such as the pancreas are 
far more active when an animal is fed. 
While these cells are active, the DNA 
of their chromosomes takes up 50% 
more labeled nitrogen. Possibly the 
DNA of nerve cells is more active than 
that in most other tissue. 
Furthermore, the stability of the DNA 
molecule does not necessarily preclude 
it operating as a source for memory 
changes. In fact its stability would 
appear to be an argument in its favor. 


The experiential code requires ae 
stability. If memory were maintaine 
too active 


by molecules which were 


metabolically, then it would change 


rapidly causing chaotic behavior. In 


any event, it may be that changes at the 
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base sites do occur. Yet such an inter- 
pretation must be held with caution, 


RNA ; 
Information concerning the possibility 
of RNA playing a role in memory is 
more encouraging at first sight. RNA 
varies from cell to cell and is very active 
metabolically (Ris, 1957). Sinsheimer 
(1960) reported that the amount of 
RNA in the salivary gland of Drosoph- 
ila drops rapidly during early stages 
of differentiation whereas the amount of 
DNA increases, and changes in the over- 
all nucleotide composition of RNA of 
Chlorella cells occur during starvation. 
LeBaron (1959) indicated that other 
cytochemical research provides evidence 
for increased activity of cellular RNA 
and proteins. He concluded that: 
There is certainly ample evidence for the 
active turnover of various lipide, protein and 
nucleic acid structural constituents, and the 
Possibility exists that there is some alteration 
of this turnover on stimulation [p. 897]. 


Hydén (1961) reported that the RNA 
content of the nerve cell ranks with the 
highest of all cells in the body. He 
showed that in man the RNA content 
of the motor nerve cells in the spinal 
cord increases from the third year of life 
to age 40, remains constant to about 60, 
and then declines rapidly thereafter. He 
found that if an animal is deprived of 
stimulation in one of the sensory sys- 
tems, for example, in vision or hearing, 
the neurons in that system do not de- 
velop biochemically. The structure ap- 
peared normal but the nerve cell was 
impoverished in both RNA and proteins, 
Hydèn maintained that individuals with 
certain psychic disorders have smaller 

amounts of RNA and proteins in gan- 
glion cells of the central nervous system 
than do normal individuals (cited by 
Davidson, 1960). Administration of 
malononitrile to these individuals in- 
creased the content of these substances. 
Egyhazi and Hydén (1961) indicated 


that the malononitrile action is due to 
the formation of a dimer of malononi- 
trile, tricyanoamino-propane. They re- 
ported that small amounts of this latter 
compound caused a remarkable increase 
in the amounts of proteins and RNA in 
the cell and modified the base composi- 
tion of the RNA with guanine increas- 
ing by almost 300%. 

Riesen (1958), in discussing his work 
on light deprivation, referred to Bratt- 
gard’s findings that the content of RNA 
and proteins decreases in retinal ganglion 
cells with prolonged light deprivation. 
He thought that RNA and protein were 
so highly susceptible to recent prior stim- 
ulation as to obviate their being consid- 
ered as a mechanism for durable mem- 
ory. Instead he reasoned that they might 
be important for immediate memory. 
Pertinent to this thought is the work of 
Geiger, Yamasoki, and Lyons (1958). 
These individuals stimulated the brain 
cortex of cats and found a change of 
RNA in the stimulated areas, which was 
reversible in minutes, 

Likewise, Morrell (1961) has shown 
gross changes in RNA of nerve cells in 
a cellular learning-like situation. He 
stimulated a portion of the cortex with 
ethyl chloride spray. The homologous 
tissue of the opposite hemisphere showed 
activity during this stimulation, At 
first the activity in the opposite hemi- 
sphere appeared only when the stimu- 
lated cortex was active. After a time, 
the tissue in the nonstimulated area 
showed spontaneous excitation even 
when it was isolated from the stimulated 
hemisphere by cutting its connections: 
Biochemical analysis of the neurons m 
the isolated tissue showed a change in 
the RNA content. Kreps has also 
reported an altered RNA turnover i? 
conditioning (cited by Gerard, 1960): 

The above results have been pertinent 
to sensory stimulation experiments: 
However, several experiments have bee” 
concerned with memory functions. I0 
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direct support for the RNA modification 
hypothesis is provided by the results of 
Cameron and Solyom (1961). They 
found that administration of RNA (but 
not DNA) to aged individuals brought 
about memory improvement. These 
changes involved almost total retention 
in some cases. Later, when the RNA 
was discontinued memory relapses oc- 
curred. However, it is possible that the 
RNA was merely supplying a nutritional 
need and would improve any type of 
performance. This result should be con- 
sidered relative to Hydén’s finding that 
the amount of RNA decreases after the 
age of 60. 

An experiment by Corning and John 
(1961) is also pertinent. Using a clas- 
sical conditioning procedure, pairing 
light with shock, they conditioned a 
number of flatworms and then tran- 
sected them into head and tail sections. 
Previous experimentation had indicated 
that heads will regenerate new tails, 
tails will regenerate new heads, and 
both will retain some “memory” of the 
avoidance situation. Corning and John 
thought that RNA might play a role in 
the transmission of an acquired struc- 
tural configuration from the trained to 
the regenerating tissues. Thus they 
reasoned that if the trained portions 
Were regenerated in the presence of 
RNase (ribonuclease), the enzyme 
would affect the altered RNA structure, 
producing some animals with a naive 
head (regenerated portion) and trained 
tails and others with trained heads and 
naive tails (regenerate). They stated 
that the head region would probably be 
dominant and thus the trained head 
animals should show more retention. 
Their results indicated that heads re- 
generated in RNase retained the mem- 
ory as well as did head and tail sections 
regenerating in pond water. However, 
the tails regenerating in RNase per- 
formed randomly. The authors sug- 
gested that the RNase did not affect 


intact tissue but did interfere with re- 
generating tissue and maintained that 
the results are compatible with the as- 
sumption that RNA is a memory 
mechanism. 

The idea that changes in the linear 
sequence of bases in RNA constitutes 
the experiential code has been offered 
independently by a number of indi- 
viduals. However, Dingman and Sporn 
(1961) suggested that changes in the 
helical structure and overall configura- 
tion, as well as sequence changes, could 
be the basis for memory. They per- 
formed two experiments with radio- 
active 8-azaguanine injections in rats 
which were pertinent to the RNA hy- 
pothesis. 8-azaguanine was used as an 
inhibitor of RNA because this base ana- 
Icgue had been shown to be an inhibitor 
of enzyme synthesis in bacteria. In 
both experiments paper chromatographic 
procedures indicated that the base ana- 
logue had been incorporated into the 
RNA of the brain. In neither experi- 
ment was there any significant differ- 
ence between experimental and control 
animals in average time to run the maze, 
Suggesting that 8-azaguanine had no 
adverse effect on the motor ability of 
the animals. However, in one study the 
experimental animals had a greater aver- 
age number of errors than did the con- 
trols on all 15 trials in the learning of 
a maze. In another experiment con- 
cerned with retention of a maze pattern 
(tested by a single trail after learning a 
maze), experimental animals did not 
differ significantly from control animals 
even though the experimental animals 
had a greater average number of errors 
than did the control animals. There 
were only 8 animals used in each group 
(as compared with 14 and 15 in the 
learning experiment) ; thus it is possible 
that if n had been larger in the retention 
experiment, the results would have indi- 
cated that 8-azaguanine adversely affects 
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both learning and retention of maze pat- 
terns in rats. g 

Based on their results Dingman and 
Sporn maintained that RNA may be 
directly involved in learning but not in 
retention. However, they admitted that 
their results did not necessarily indicate 
that RNA metabolism was intimately 
linked with the formation of memory 
traces in the brain because 8-azagua- 
nine might have interfered with meta- 
bolic processes which affected RNA in- 
directly. 

Most of the above studies in which 
RNA changes were reported do not ex- 
clude the possibility that the basic 
changes were effected in DNA which 
then brought about changes in RNA. 
However, the results of Cameron and 
Solyom and of Corning and John appear 
to argue against the involvement of 
DNA in memory. But both experiments 
appear to suffer from methodological 
defects, The work by Cameron and 
Solyom lacked the use of doubleblind 
or similar techniques to eliminate ex- 
perimenter bias in all experiments. 

The results of Corning and John are 
not as clear-cut as they appear. They 
performed multiple ¢ tests (exact num- 
ber not specified) and reported only the 
significant ones. It is possible that 
some of these were Type I errors in that 
with many groups such as Corning and 
John had, the probability of Type I 
errors is greater than that specified in 
¢ tables. Furthermore, from a qualita- 
tive analysis of their basic table of data 
there is a suggestion that RNase may 
have a residual effect, sensitizing or 
irritating the animals such that they are 
predisposed to make a great number of 
responses to light. In the retention data 
three groups which were in RNase 
(other than tail animals) showed a 
greater number of CRs to light on each 

of 3 days than did corresponding con- 
trol groups in pond water. These results 


would probably be significant if a re- 


peated measurements analysis were at- 
tempted. Because of these considera- 
tions the Corning and John study is 
inconclusive. 

The big problem faced by advocates 
of the RNA hypothesis is that RNA is 
not of a single homogeneous type. There 
are several types of RNA which vary in 
structure and function. One type, mes- 
senger RNA, which receives the basic 
genetic code from a strand of DNA 
(Hall & Spiegelman, 1961; Hurwitz & 
Furth, 1962) would appear to be the 
most important one to serve as an ex- 
periential code. Another type, soluble 
RNA, which functions to carry activated 
amino acids to the ribosomes for protein 
synthesis, and other RNA in the ribo- 
somes might be considered as “junk” for 
information coding purposes. Thus, the 
changes in amounts of RNA, or base 
changes, demonstrated by Hydén and 
others may be relative to junk RNA 
through metabolic activity and be to- 
tally unrelated to information coding. 
The reversible changes in RNA content 
reported by Geiger et al. (1958) and 
by Riesen (1958) appear to support this 
possibility. 

Therefore, even though there is more 
optimism among scientists about the 
RNA hypothesis than the DNA hy- 
pothesis, there is no conclusive (or even 
good suggestive) evidence that the 
former is more appropriate than the lat- 
ter. Furthermore, even though the ex- 
perimental results superficially tend to 
suggest the involvement of RNA in 
memory, its extreme metabolic reac- 
tivity raises some doubt as to how 
stable memory can be handled by an 
overly reactive molecule. On the other 
hand, the extreme reactivity may reflect 
gross aspects, with base changes re- 
sponsible for stable memory less in- 
volved in overall reactivity. But, as has 
been indicated above, Riesen’s sugges” 
tion that RNA subserves an immediate 
memory function may be entirely appro” 
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priate and require that another molec- 
ular mechanism be postulated for main- 
taining permanent memory. 


Conclusions 


If we consider memory as of two 
types, transient and permanent (Hebb, 
1949; Jeffress, 1951), and use DNA and 
RNA as candidates for these functions, 
we have four possibilities as indicated 
in Table 1. However, we should add two 
possibilities which correspond to DNA 
or RNA serving only for permanent 
memory whereas a short-term phe- 
nomenon such as reverberatory circuits 
could suffice for transient memory. We 
should also add two other possibilities 
which involve DNA or RNA function- 
ing for transient memory with rever- 
beratory circuits maintaining permanent 
memory. 

Of the eight possibilities, A and C 
(which use DNA for transient memory) 
seem to be negated because of the rela- 
tively great stability of DNA. The 
Cameron and Solyom (1961) and Corn- 
ing and John (1961) results might sug- 
gest eliminating Possibility D. Evidence 
(see below) suggesting that reverbera- 
tion ceases within an hour would pre- 
clude Possibilities G and H. Intuitively 
the possibilities in which reverberatory 
circuits maintain short-term memory 
until changes can be effected in DNA or 
RNA (E and F) seem most likely, 

Morrell (1961, p. 478) cites data by 
Chow which is consistent with this pos- 
sibility. Chow reported that electrical 


changes in neural tissue were detected 
only during the early stages of condition- 
ing. Morrell uses these results, as well 
as others indicating that memory sur- 
vives convulsions, electroshock, trauma, 
concussion, and deep anesthesia, to im- 
plicate RNA as a possible durable mem- 
ory molecule. Likewise, John (1961, p. 
465) stated that reverberatory circuits 
could be the basis of short-term memory 
but that long-term memory would seem 
to necessitate a structural modification. 

That reverberatory circuits are the 
basis for short-term memory appears 
plausible because of much experimental 
data. For example, experiments have 
shown that individuals who learn some- 
thing and then go to sleep tend to retain 
more than do individuals who do not 
sleep after learning (Crafts, Schneirla, 
Robinson, & Gilbert, 1950). Disruption 
of the “consolidation process” by shock- 
ing animals immediately after a learn- 
ing event (running a maze) is reported 
to interfere with the memory for the 
event (Duncan, 1949). Shocks intro- 
duced less than a minute after the run 
interfered more than did shocks given 
later. Gerard (1953) found similar re- 
sults with hamsters. These results ap- 
pear to indicate that some transient ac- 
tivity is responsible for memory, and if 
other activities intervene there is less 
chance for the permanent changes to 
occur. Likewise, observations with 
elderly individuals support the notion of 
reverberatory circuits. In these people 


TABLE 1 


Some PLAUSIBLE NEUROCHEMICAL POSSIBILITIES IN DEALING WITH TRANSIENT 
AND PERMANENT MEMORY 


Possibilities H 
A B € D E F G aa 
Transient DNA RNA DNA RNA Reverberatory DNA 
Circuits 
Permanent DNA RNA RNA DNA DNA RNA Reverberatory 


Circuits 
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that the brain tissue is less 
oe ae hardening of blood vessels 
and other tissues occurs. If the tissue 
is less plastic, then it would seem that 
the initial reverberatory circuits would 
be less effective in developing neural 
changes. Thus the retention of the 
learning of new materials would be less 
than when the individual were younger. 
This is exactly what one finds with 
elderly persons, with difficulty in the 
learning of new materials and quick 
forgetting of newly learned material. 

Other individuals have also suggested 
a dual process for memory. For exam- 
ple, Dethier and Stellar (1961) main- 
tained that brain-damaged studies with 
man and animal-learning experiments 
involving shock suggest that memory is 
a two-part process: an initial, vul- 
nerable process lasting 15 minutes to an 
hour; and a later, invulnerable process 
providing the permanent basis for 
memory. 

Glickman (1961) suggested essen- 
tially the same, a reverberatory circuit 
hypothesis for consolidation of memory 
traces. He cited work by Burns (1954, 
1958) which is consistent with the hy- 
pothesis. Burns isolated small areas of 
the cortex and noted their responses to 
electrical stimulation. He found that 
single trains of pulses can initiate ac- 
tivity lasting for 30 minutes or more, 
that later electrical stimulation inter- 
feres with this activity, and that the 
activity becomes easier to evoke with 
repetitions of the stimulus. 

The two types of memory would de- 
pend upon different neurological sites 
(Gaito, 1961). The reverberatory cir- 
cuits would depend upon synaptic trans- 
mission and be necessary (but not suffi- 
cient) to effect changes in DNA or RNA 
elsewhere in the cell. The changes in 

the nucleic acids would appear to occur 
in the soma of brain nerve cells in that 
DNA is located in the nucleus and Nissl 
substance, which contains RNA, is more 
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profuse in the cytoplasm of the cell 
body than in either the dendrites or in 
the axons (Hydèn, 1959). 

In conclusion, even though much 
microbiological evidence indicates that 
DNA is the genetic code, microbiologists 
are skeptical about its possibility as the 
experiential code. Yet it has not clearly 
been eliminated. RNA, however, ap- 
pears superficially to be implicated in 
memory events and may be the experi- 
ential code. The experimental results 
are not definitive as yet. RNA may be 
the basic molecule for the experiential 
code or it may be indirectly involved 
through the effect of a more basic mole- 
cule or enzyme. If RNA is the experi- 
ential code, the exact mechanism of 
coding is uncertain. The linear sequence 
of purines and pyrimidines appears tO 
be the most plausible mechanism in that 
these sequences provide the basis for 
genetic coding in DNA. However, the 
suggestions of Dingman and Sporn 
(1961) that the helical structure and 
overall configuration are other possible 
mechanisms should be seriously con- 
sidered. 

Thus, there is definite evidence tO 
indicate that gross changes in DNA and 
RNA can occur during stimulation. 
However, no one has directly detected 4 
change of submolecular structure in 
either DNA or RNA such as discussed 
above; all the evidence for changes are 
of indirect nature. The hypothesizing 
about nucleic acids must remain in the 
realm of speculation. The validity of 
these hypotheses must await further re- 
search by biochemists, psychologists) 


neurophysiologists, and others of relate 
areas, 


OVERALL EVALUATION 


A neurochemical approach to learning 
should attempt to describe the bio” 
chemical events which underlie tP? 


major learning phenomena, viz., mecha 
nisms for short-term memory, long-t€" 
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memory, reactivation, normal forgetting, 
special forgetting (such as in hypnosis, 
repression, amnesia), and overall inte- 
gration. Let us look at the neurochemi- 
cal approaches relative to each of these 
phenomena. 


Short- and Long-Term Memory 


At present there are essentially two 
approaches to the representation of ex- 
perience, i.e, memory. The first stems 
from the Hebbian influence and involves 
the linking together of neural units 
which by their linkage symbolize experi- 
ences. The molar group employ this 
conceptualization. Short-term memory 
is maintained by reverberation through 
various circuits until some synaptic 
change can occur which will then in- 
crease the probability of excitation 
throughout the circuity when later some 
portions are excited. The synaptic 
change for Overton is calcium displace- 
ment. The California group do not con- 
cern themselves with short- and long- 
term memory but the synaptic events 
which they stress are the ACh-ChE ac- 
tivities, 

The second means of representing ex- 
perience is symbolically by a chemical 
code, viz., the linear sequence of bases 
in DNA or RNA as hypothesized by the 
molecular approach. 

Actually both approaches could allow 
for reverberatory circuits to maintain 
transient memory. Differences, however, 
would appear in handling permanent 
memory. The molecular group would 
allow for changes to occur at the synapse 
but these changes would not be the end 
result. They would merely be pre- 
liminary to a change of the code else- 
where in the cell; i.e., they would be 
necessary, but not sufficient, for learn- 
ing. 


Reactivation 


The California group are not con- 
cerned with this or the remaining events 


we will consider. Overton allows the 
synaptic changes (calcium displace- 
ment) to facilitate the reactivation of 
the reverberatory circuits to bring about 
recall. Thus he treats permanent mem- 
ory and reactivation as if they were the 
same. However, it appears that some 
distinction should be made between the 
“trace” and the process which reac- 
tivates the trace (Koffka, 1935). 

Of the molecular group only Hydén 
has attempted to explain a reactivation 
process. A change in one or more bases 
in RNA is effected through frequency 
modulation. The changed RNA will 
synthesize new protein which differs in 
amino acid sequence from protein syn- 
thesized by the previous RNA. When 
later the nerve cells are stimulated by 
the same frequency the protein is acti- 
vated, dissociates, and then the disso- 
ciated products combine with a comple- 
mentary molecule. These events acti- 
vate a transmitter substance (ACh or an 
inhibitory substance) and the post- 
synaptic structure is excited. Thus reac- 
tivation requires that the modified pro- 
tein be excited by nerve impulses which 
have the same frequency as those which 
previously modified the RNA. 


Normal and Special Forgetting 


Only Overton has considered these 
events. Normal forgetting results from 
disruption of reverberatory circuits, 
presumably by increased calcium depo- 
sition. To handle special forgetting, 
viz., repression, he maintained that some 
reverberatory circuits are protected by 
the jamming of other neural activity 
which might disrupt those circuits. 
Jamming represents the repression of 
immediate or recalled memories. How- 
ever, Overton did not indicate the neu- 
rological or biochemical basis for jam- 
ming. 

Integration , 

An attempt to describe overall we 
gration biochemically would appear to 
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i erage of the above 
E TE a that 
i of the approaches have talked of 
integration. However, Overton heads 
somewhat in that direction when he 
describes insight as the combining of 
two or more reverberating circuits into 
one large circuit (a la Hebb). 

This section indicates that the neuro- 
chemical approaches are incomplete. 
This is to be expected inasmuch as these 
approaches are relatively new. The two 
approaches need not be considered as 

antagonistic; they appear to be of com- 
plementary nature in that the descrip- 
tion of each is on a different level. The 
ACh-ChE activity, and possible acetic 
acid etching of calcium compounds, may 
be the important events which occur at 
the synapse and facilitate the modifica- 
tion of DNA and RNA elsewhere in the 
cell, or vice versa. One could even add 
the neurological approach of Hebb 
(1949) with cell assemblies and phase 
sequences and have further comple- 
mentarity. The three approaches de- 
scribe events on different levels of dis- 
course and can be valuable supplements 
of each other. 

Of the various proponents of the neu- 
rochemical approach, the California 
group sticks closest to empirical results 
and thus resorts to little speculation. 
The others use empirical data as a 
springboard for their speculations. Ob- 
viously such speculations may be com- 
pletely wrong in the ultimate analysis; 
however, they provide investigators with 
a number of provocative ideas to guide 
experimental research. 

The specificity of the molecular ap- 
proach is a definite advantage over the 
molar approach in that a description of 

events at submolecular levels may pro- 
vide biological scientists with the 
“atoms” of behavior. Furthermore, be- 
havioral scientists have always been in- 
terested in the relationship between 
heredity and environment. The molec- 


JOHN GAITO AND ALBERT ZAVALA 


ular advocates are attempting to handle 
this relationship at a molecular level. 
Gaito would have the two interacting 
within the same molecule (DNA) or 
through two molecules. In the latter 
case DNA provides the basic code to be 
passed on to messenger RNA, which is 
modified through experiential events. 
This interpretation would also be the 
viewpoint of Dingman and Sporn and 
of Hydén. 

The authors believe that the molecu- 
lar viewpoint will be the most useful 
within the next decade or so in that 
rapid advances in microbiology will 
force behavioral scientists to include the 
nucleic acids in their schema. However, 
the ultimate value of both approaches 
will be determined by experimentation. 
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The major purpose of the review was to examine the validity of the 
assumption that the laws of verbal learning obtained on the college 
student will hold reasonably well for younger populations of Ss. Vari- 
ous areas of verbal learning, such as free learning, paired-associate learn- 
ing, serial learning, transfer, retention, and retroactive-proactive inhibi- 
tion, were reviewed in order to determine the extent to which the general 
findings of these areas, based on experiments in which the college stu- 
dent has served as S, have been investigated and replicated with children 
as Ss. The literature on children’s verbal learning revealed a reasonable 
correspondence in the effective variables reported and in the relation- 
ships that have been identified with adult Ss. 


For the past 2 decades subjects in ex- 
periments on verbal learning have been 
preponderantly the college sophomore, 
with only an occasional experiment re- 
ported in which younger or older sub- 
jects were employed. One obvious 
reason for the heavy use of college stu- 
dents is their ready availability. But a 
second reason may be an implicit as- 
sumption that the laws of verbal learn- 
ing obtained on the college student will 
hold reasonably well for other popula- 
tions. The major purpose of the present 
review is to examine the validity of this 
assumption. This will be done by com- 
paring the results of the available studies 
in verbal learning which have used 
children with the results obtained in ex- 
periments where the college student has 
been used as the subject. 

Comparisons of the above nature may 
be used to answer several questions. 
First, we may ask whether the various 
phenomena of verbal learning derived 
from the college student have in fact 
been investigated with subjects of 
younger ages. Second, if such studies 
have been done, we may ask if the same 
general phenomena emerge with both 
children and college students; for ex- 
ample, is retroactive inhibition produced 
at various age levels? Third, we may 
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ask if the specific laws relating a phe- 
nomenon to various manipulable vari- 
ables are the same for both college stu- 
dents and children; for example, is the 
relationship between amount of retro- 
active inhibition and degree of inter- 
polated learning the same? Obviously 
the ability to answer these questions— 
questions which increase in specificity— 
depends upon the availability of ap- 
propriate data. It will be seen that most 
of the review must necessarily be con- 
fined to answering the first two ques- 
tions, namely, are studies available, and, 
if so, how do the general findings com- 
pare. 

Certain restrictions were placed on 
the studies to be reviewed. 

1. From the studies using the college 
student as subject, only phenomena 
which are judged to be firmly estab- 
lished experimentally will be considered. 
For example, reminiscence cannot be 
consistently produced with college stu- 
dents, and it would be of little value for 
the present review to ask whether or not 
it can be produced with children. 

2. For any given reliable phenomenon 
derived from experiments using college 
students, no attempt will be made to cite 
complete references. Rather, illustrative 
references will be given. 

3. Exclusion of studies of verbal or 
language behavior. 
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4. Exclusion of studies using non- 
be date for this review is 
1940, with a selection being made of 
earlier studies. Reviews are available of 
studies of verbal learning in children 
conducted prior to 1940 (e.g., McGeoch, 
1942; McGeoch & Trion, 1952; Munn, 
1954; Peterson, 1933; Spiker, 1960b). 
The review will follow an arbitrary 
outline which first considers the simpler 
experimental operations in verbal learn- 
ing, followed by the more complex. 
Although not considered systematically, 
several studies concerned with the as- 
sociative or verbal repertoire of the sub- 
ject will be discussed first, followed by 
various methods by which verbal learn- 
ing is studied, specifically, free learning, 
paired-associate learning, serial learning, 
and other learning methods, A second 
general classification pertains to the 
more classical topics of verbal learning 
in which two-stage effects are studied, 
such as transfer, simple or single-list 


retention, and retroactive and Proactive 
inhibition, Finally, 


children learning to 


not reached college age 


ASSOCIATIVE REPERTOIRE 


The learning that is observed in any 
verbal-learning experiment js super- 
imposed upon all language learning that 
has occurred prior to the laboratory ex- 
periment; it is this network of language 
habits that may be termed the sub- 
ject’s verbal or associative repertoire, 
This repertoire may consist of various 
components, such as the subject’s reper- 
toire of words and letters, the associa- 
tions between these words (e.g., Russell 
& Jenkins, 1954) and between letters 
(e.g., Underwood & Schulz, 1960), and 
any learned habits of combinations of 
letters or words, e.g., syntactical-gram- 


matical habits. The free-association 
technique has provided a useful method 
of assessing the associative repertoire 
of adults and children; the older studies 
comparing free-association norms of 
children and adults are reviewed by 
Woodworth and Schlosberg (1954, pp. 
52-56). Recently, Brown and Berko 
(1960) compared the free associations 
of first-, second-, and third-grade chil- 
dren and adults with regard to the tend- 
ency to give responses from the same 
form class as the stimulus word; for all 
ages, nouns and adjectives produced 
more such homogeneous responses than 
other parts of speech and the tendency 
to give these responses was found to be 
a direct function of age. Similarly, 
Ervin (1961) found a significant in- 
crease with age (Grades 1-6) in the pro- 
Portion of responses from the same 
gtammatical class as the stimulus word. 

Letter-association norms, obtained for 
college students (Underwood & Schulz, 
1960, Ch. 9), have been useful in spec- 
ifying extraexperimental sources of 
letter-to-letter interference for two- and 
three-letter units (e.g., Underwood & 
Postman, 1960). In an unpublished 
study, the author obtained single-letter 
responses to a sample of single- and 
double-letter stimuli from high school 
students and from second-grade stu- 
dents. The frequency of single-letter re- 
Sponses to the single- and double-letter 
stimuli for the two groups was highly 
he frequency shown by 
Schulz, product-moment 
approximately .80 and 
60 being obtained for high school and 
second-grade students, respectively, The 
hese correlations implies 
a relatively early learning of letter-lan- 
guage habits that appear to be of some 
consequence to the adult verbal learner. 
Furthermore, the relatively high cor- 
respondence in letter associations for the 
three groups may provide for differential 
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predictions of the learning rates as a 
function of meaningfulness for various 
age groups. For example, it should be 
possible to construct lists of materials 
varying in meaningfulness but equal in 
difficulty at each meaningfulness level 
for the various age groups as inferred 
from the respective letter-association 
norms. Assuming that the strength of 
the letter-language habits is a direct 
function of the age of the subject, then 
an interaction between meaningfulness 
and age may be obtained, variations in 
meaningfulness producing smaller differ- 
ences in learning rates for children than 
for college students. Specifically, this 
prediction follows if it is assumed that 
the letter habits of college students tend 
to produce more interference in the 
learning of low meaningful material and 
more facilitation in the learning of high 
meaningful material than for the child 
whose letter-language habits are not as 
strong. 

These developmental studies of word 
and letter associations may provide the 
verbal-learning experimenter with an 
understanding of the various determi- 
nants of the adult free-association norms 
and an appreciation of the complex re- 
lationships exisiting among verbal units. 

Free LEARNING 

Perhaps the simplest learning situa- 
tion, free learning usually involves the 
single presentation of a series of words 
and the subsequent recall of the words 
in any order; examples of free learning 
are shown by the work of Bousfield (e.g., 
Bousfield, Cohen, & Whitmarsh, 1958) 
and Deese (1961). In this area several 
variables have been isolated and manip- 
ulated with college students; namely, 
interitem associative strength, meaning- 
fulness, intralist similarity, and length 
of list. Aside from the series of studies 
emanating from the Catholic University 
of America which will be considered at 
a later point, only a handful of studies 


involving children remain in this impor- 
tant and active research area. 

The influence of interitem associative 
strength has not been investigated with 
children. The fact that free-association 
norms are not readily available for 
children may account for this neglect; 
fortunately, a forthcoming publication 
(Palermo & Jenkins, 1963) will give 
word associates for many age levels. 
Considering the importance that some 
have placed on interitem associative 
strength (Deese, 1961), an extension of 
this work to children, for whom inter- 
item associations may be weaker, may 
produce results which differ from those 
obtained with college students, 

With regard to meaningfulness, an 
older study by Achilles (1920) using 
children (aged 814-1114) and a 5-min- 
ute study period reported greater recall 
for words (Y=5.22) than for nonsense 
syllables (X=1.86). The basic results 
are in line with those of Postman and 
Phillips (1961) who, with college stu- 
dents, obtained greater recall for adjec- 
tives than for nonsense syllables in a 
free-recall situation. 

The single study of formal intralist 
similarity in free recall with children 
(Horowitz, 1961) was performed with 
high school students. For the first four 
alternate study and free-recall trials the 
high similarity condition was superior to 
the low similarity condition. The near- 
est comparison with college students is 
found in an experiment by Miller (1958) 
in which the free recall of redundant 
strings of letters was studied, the more 
redundant strings, i.e., items with high 
formal intralist similarity, being more 
easily learned. The same relationship 
has been found for meaningful similarity 
(Underwood, Runquist, & Schulz, 1959) 
where the free learning of high and low 
similarity adjectives resulted in better 
recall of the highly similar items. a 

The work af "Murdock (1960), in 
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i he length of list in free learning 
ee tale. has not been studied 
with children; however, there is no 
particular reason to expect the linear 
relation between the number of items 
learned and the total presentation time 
which he found with college students 
will not hold for children. ; 

A widely studied phenomenon in free 
learning is that of associative clustering 
in recall. First studied by Bousfield 
(1953), associative clustering has only 
recently been investigated in children. 
Bousfield, Esterson, and Whitmarsh 

(1958) compared third-grade, fourth- 
grade, and college subjects in the type 
of clustering used (perceptual or con- 
ceptual) and in the degree of clustering, 
by presenting 25 stimulus pictures which 
could be grouped on the basis of five 
colors or five categories such as birds, 
fruits, flowers, nature, and vegetables. 
A direct relationship was obtained be- 
tween the degree of total clustering for 
the color and conceptual categories com- 
bined and the age of the subjects. Hol- 
toyd and Holroyd (1961), using high 
school students, compared the degree of 
clustering and total recall for lists dif- 
fering in the degree of associative fre- 
quency, that is, the frequency with 
which words were elicited by four key 
words or clusters. Both degree of clus- 
tering and the amount of absolute recall 
were directly related to the associative 
frequency of the words represented in 
the four categories, thus replicating the 
findings with college subjects by Bous- 
field, Cohen, and Whitmarsh (1958). 


PAIRED-ÅSSOCIATE LEARNING 


It has been considered both logically 
and theoretically convenient to view 
paired-associate learning as occurring in 
two general stages, the response-learning 
stage and the associative or hook-up 
stage (€g. Underwood, Runquist, & 
Schulz, 1959). Spiker (1960b) has also 
recognized the advantages of the paired- 


associate method which allows the pos- 
sibility of an independent manipulation 
of stimulus and response functions and 
is easily adapted for children; however, 
in general, investigators using children 
have not taken advantage of the ana- 
lytical properties that this method pro- 
vides. 

Backward Associations, One aspect of 
the association studied with college sub- 
jects is the formation of a backward or 
R-S association in addition to the usu- 
ally studied forward or S-R association 
(e.g, Richardson, 1960). Similarly, 
backward associations have been dem- 
onstrated in children (Grades 4 and 6) 
by Palermo (1961) who estimated the 
R-S ‘association to be approximately 
57% as strong as the S-R associations; 
no difference was obtained between R-S 
associations for the two grades. 

Meaningfulness. In spite of recent 
theoretical interest in this variable (e-8+ 
Underwood & Schulz, 1960), there are 
no studies reported in which meaningful- 
ness was specifically manipulated as 4 
task variable with children as subjects- 
If the predifferentiation (PD) procedure 
may be considered as a means by whic 
meaningfulness may be “built-in,” sev 
eral studies using children have been re 
ported which represent the manipulation 
of stimulus meaningfulness. Saltz, Met- 
zen, and Ernstein (1961) gave tw? 
groups of children (aged 6-7 and 10-12) 
15 trials of PD training in which thé 
stimuli, four silhouettes of common ob- 
jects, were associated with stars place! 
randomly on a board. Following P 
training the four PD stimuli and ae 
control stimuli were combined with eigh’ 
single-letter responses to form t 
paired-associate task, Comparisons ° 
the PD and control pairs revealed 
slight inhibitory effect for both 38 
groups, the effect being greater for 
older subjects. 


0. 
Two experiments by Muehl (196° 
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1961) were concerned with the effect of 
visual discrimination pretraining upon 
subsequent paired-associate learning. In 
the first experiment (Muehl, 1960), 
pretraining consisted of a matching task 
in which the subject matched a visually 
presented stimulus with the same item 
appearing on a response card. Three 
groups of kindergarten children differed 
in the type of item that was matched, 
one group matching the four words which 
would form the subsequent learning task, 
one group matching four completely dif- 
ferent words, and one group matching a 
series of four geometrical forms. Fol- 
lowing six matching trials the subjects 
were confronted with four paired asso- 
ciates, the stimuli being the four words 
of the first group and the responses being 
the verbal pronunciation of the words. 
The subjects attempted to anticipate the 
verbal response within the 2-second in- 
terval preceding the presentation of the 
stimulus and the experimenter’s pro- 
nunciation of the word. The group re- 
ceiving the same stimuli for matching 
and learning gave significantly more cor- 
rect anticipations for the 12 learning 
trials than the other two groups, thus in- 
dicating a positive effect for stimulus 
PD. In a later experiment, Muehl 
(1961) found that PD training with 
identical or similar stimuli produced no 
differences in paired-associate learning. 
These experiments with children indi- 
cate that both positive and negative ef- 
fects of PD training may be obtained. 
In this respect the results reported with 
adults are comparable, stimulus PD 
training producing both positive (e.g., 
Gannon & Noble, 1961) and negative 
(e.g., Underwood & Schulz, 1960) ef- 
fects. In view of the contradictions in 
the results for both adults and children, 
further experiments may be necessary 
to resolve these discrepancies (in this re- 
gard see Schulz & Tucker, 1961). 
Research with college subjects has 


shown that manipulation of response 
meaningfulness of paired associates pro- 
duces reliable positive effects in learning 
(Underwood & Schulz, 1960). A sug- 
gestive comparison experiment with chil- 
dren is found in an older study by Stod- 
dard (1929), using high school students, 
who showed almost two times as much 
learning for 25 French-English vocabu- 
lary pairs than for the “turned-over” 
English-French pairs, indicating the 
large effect of response meaningfulness 
on paired-associate learning. It is pos- 
sible that future research will reveal that 
the various measures of meaningfulness 
may not predict learning as well for chil- 
dren as for adults. For example, pro- 
nunciation ratings of nonsense syllables 
may lose their predictive value for the 
learning of nonreading subjects. At any 
rate, manipulations of this variable 
would appear to offer possibilities for a 
fruitful series of comparative adult- 
children studies. 

Intralist Similarity. As in free learn- 
ing, the three types of intralist similar- 
ity, formal (letter duplication), mean- 
ingful (synonyms), and conceptual, have 
all been shown to be influential variables 
in paired-associate learning of adults. 
In spite of the importance of this vari- 
able for the learning of college subjects, 
no studies have been reported which 
directly attack intralist similarity with 
children. An inspection of the pairs 
employed by Postman and Murphy 
(1943) reveals high intrastimulus and 
response similarity being produced by 
synonyms and conceptual similarity 
among the experimental pairs and low 
similarity among the control pairs. 
These pairs were combined in a single 
list and learned by eighth-grade stu- 
dents. The significant difference in speed 
of learning in favor of the control pairs 
could be taken as evidence of the opera- 
tion of intralist similarity of children. 


# ne 
However, this type of “inferential 
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i tion of similarity is only sug- 
are a cannot replace experiments 
in which similarity is expressly manipu- 
lated. It would be valuable, for example, 
to determine if children demonstrate 
similarity effects for the three types of 
similarity. It is possible that for the 
young subject, for whom the meaning 
of words and concepts is not known, 
only manipulations of formal similarity 
will be found to be effective; that is, an 
interaction may be expected to occur be- 
tween type of similarity manipulation 
and age of the subject. 

Associative Connection, Manipulation 
of the strength of the pre-experi- 
mental associative connection has been 
found to be an effective variable 
for adult learning. For example, 
Key (1926) found immediate recall 
scores to be directly related to the 
strength of the associative connection as 
indexed by the Kent-Rosanoff norms. 
Several experiments have been reported 
in which variations in the strength of the 
associative connection have been studied 
with children. In a study by Akutagawa 
and Benoit (1959) three lists of paired 
pictures were constructed, one list of 
familiar associates, a second of arbitrary 
associations, and a third of interfering 
associations; the subjects ranged in age 
from 8 to 13. The mean number of cor- 
rect anticipations varied directly with 
the assumed strength of the S-R associa- 
tion. This study must be viewed with 
caution, however, since the three lists 
were presented to all subjects in the 
same order, easy to difficult, thus pos- 
sibly confounding lists and practice 

effects. 

In a more adequately designed ex- 
periment, Castaneda, Fahel, and Odom 
(1961) obtained word associations from 
a population of children (Grades 4-6). 
Two lists were constructed from the 63 
stimulus adjectives, each of the six pairs 
having the same stimuli but differing in 


the strength with which the responses 
were elicited in the word association test. 
Two new groups of subjects (Grades 5 
and 6) learned either the high or low 
association list to a criterion of one per- 
fect or for a maximum of 15 trials. The 
difference between the learning measure 
for the two groups produced a highly 
significant difference. McCullers (1961) 
speculated that control of the age of the 
subjects possibly would produce dif- 
ferential results, since the children would 
differ in their linguistic and classroom 
experiences. Using the norms of Casta- 
neda et al., McCullers selected 40 
fourth- and 40 sixth-grade subjects and 
assigned them to one of eight conditions, 
two lists of high or low associative 
strength, and four interpair intervals © 
2, 3, 4, or 6 seconds. As with the study 
of Castaneda et al., the two lists pro- 
duced highly significant variation in the 
mean trials to learn, while no significant 
differences between the two grades no! 
the interitem intervals were obtained- 
Distributed Practice. In spite of the 
large number of experiments in whic? 
distributed practice has been manipu- 
lated, only one study was found in which 
children served as subjects. Kirkwoo 
(1926) used a paired-associate pro” 
cedure in which preschool subjects 
learned to pair 20 pictures with blocks 
of various geometrical forms. Masse 
practice consisted of daily training, dis- 
tributed practice of training every othe" 
day; training was continued for bot! 
groups until a criterion of three succes” 
sive correct trials was reached. Subject® 
in the distributed condition reached Le 
criterion five practice periods earlié 
than those in the massed condition arg 
showed higher scores on each trial du“ 
ing learning. 4 
While work reported by Underwor" 
(1961) indicates that distributed ee 
tice is not an important variable in n 
learning of paired associates, rec® 
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studies (e.g., Underwood, Keppel, & 
Schulz, 1962; Underwood & Schulz, 
1961) demonstrate that under certain 
conditions, e.g., an A-B, A-C relation- 
ship, distributed practice in learning 
produces marked facilitation in reten- 
tion. To the extent that a child’s learn- 
ing may be described in terms of an A- 
B, A-C transfer paradigm, this would 
certainly appear to be a fruitful research 
area for the educational psychologist. 


SERIAL LEARNING 


Spiker (1960b) pointed out that 
“whereas the literature on serial verbal 
learning in adults is massive, the re- 
search with children is less impressive 
[p. 404].” While approximately 24 
studies employing the paired-associate 
technique were discovered in preparing 
the present review, less than half as 
many using serial learning was found. 
Spiker speculated that this state of af- 
fairs may derive from the fact that serial 
learning is too difficult for the elemen- 
tary school child and, in addition, that 
the serial task is a relatively unanalytical 
tool, each item functioning both as a 
stimulus and as a response. 

Of the serial experiments conducted 
prior to 1940 using children as subjects, 
most were systematic studies of the 
memory span, determining its size as a 
function of age, modes of presentation, 
and types of material. Although these 
studies report a slow increase from early 
age to ages 10-12 and a slower increase 
up to age 18, little information is offered 
by way of accounting for these changes. 
It would seem that developmental 
studies, per se, are of limited value to 
the student of verbal learning other than 
indicating a lower age limit for potential 
experimental subjects, unless, however, 
developmental relationships are found to 
provide the background for differential 
theoretical predictions. For example, 
the age of the subject may be one way 
in which the strength and complexity of 


the associative network may be manipu- 
lated in a verbal-learning experiment. 

Most investigators agree that children, 
as do adults, produce a bowed serial 
position curve (e.g., Andrews, Shapiro, 
& Cofer, 1954; Barnett, Ellis, & Pryer, 
1960; Cassel, 1957), while others have 
observed changes in the shape of the 
curve as a function of age. Unfortu- 
nately, these latter studies (e.g., Lepley, 
1934; Peixotto, 1942) represent proce- 
dures and results so complicated that it 
is impossible to give any substantial gen- 
eralizations in summary form. In addi- 
tion to these studies, one report was con- 
cerned with the effect of an isolated 
term in serial learning (Tatuno, 1961); 
however, in this case the use of subjects 
of differing ages may have produced a 
confounding variable since elementary, 
junior high, and college students were 
placed in independent experimental 
groups, thus confounding ability level 
and the effect of the experimental 
variable. 

Little work has been accomplished in 
serial learning with children, but this 
may be a reflection of the biases of the 
verbal-learning experimenter to work 
with more analytical learning tasks. 
Recently, an important line of research 
has been pursued by Young (1961, 
1962) with college subjects, in which an 
attempt has been made to determine the 
functional stimulus in serial learning. 
Some degree of the complexity of this 
problem may derive from the fact that 
several choices of stimuli are available to 
the college student, for example, the pre- 
ceding stimulus, the preceding stimuli, 
or the serial position, with some stu- 
dents employing one type of stimulus, 
some another, and some a combination. 
Research with children may help ae 
untangle these complex relationships 
since the child may not have developed 
as strong stimulus-selection biases as the 
college student. If this is the case, then 
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a specification of the stimulus in serial 
learning may be more easily made with 
children. In any case, to the extent that 
the work concerned with determining 
stimulus-response functions in serial 
learning is successful, the use of the 
serial task in verbal learning may again 
be seen. 
OTHER LEARNING METHODS 


Syntactical Learning. Research in this 
area has taken two forms, namely, the 
study of the effect of syntax on the 
learning of nonsense sentences (Epstein, 
1961) and the study of the learning and 
retention of connected discourse (e.g., 
Slamecka, 1959). The work on con- 
nected discourse has its counterpart with 
children in the many early studies on 
the Ballard-Williams reminiscence ef- 
fect; however, these experiments are of 
little value here since interest was not 
directed toward the study of the learn- 
ing of connected discourse. Early work 
by Foster (1928) demonstrated the ef- 
fect of repetitions on the mean number 
of words from a story that a child can 
recall, while Stroud and Maul (1933) 
found a direct relationship between the 
amount of poetry recalled and the age of 
the subjects. Although this type of 
learning would appear to have much in 
common with the learning in the class- 
room, there have been no systematic ex- 
periments with children that have been 
concerned specifically with syntactical 
learning. That such experiments may 
be reasonably conducted with children is 
indicated by the work of Berko (1958) 
who has demonstrated that strong and 
consistent syntactical habits have al- 
ready been acquired by the preschooler. 

Verbal Concept Learning. Although 
not exclusively in the area of verbal 
learning, mention will be made of one 
study which represents an attempt to 
extend the findings of adult concept 
learning to that of children. The work 
of Underwood and Richardson (1956) 


was repeated by Runquist and Hutt 
(1961) in an attempt to determine the 
effect of grade level (9-12) on the acqui- 
sition of high dominant concepts. The 
task was to discover the four concepts 
which would describe 16 stimuli, repre- 
sented visually as pictures or verbally 
as words; with the 16 stimuli there were 
four instances of each concept. Two sets 
of pictorial stimuli were used, one set 
which emphasized the dominant concept, 
the other which minimized it. The mean 
total correct concept naming in 15 trials 
was found to increase as a function of 
grade for all three sets of stimuli, with 
an indication of an interaction between 
the sets and grade, the difference be- 
tween the types of stimuli being greater 
for Grade 9 than Grade 12. 

Verbal Discrimination. Several studies 
utilizing college subjects have been 
reported (e.g., McClelland, 1942; Run- 
quist & Freeman, 1960) in which a ver- 
bal-discrimination procedure was em- 
ployed; no studies were found in which 
children were used. 

TRANSFER 

Learning-to-Learn. The mere learning 
of a series of paired-associate or serial 
lists by college subjects has been shown 
to produce increased performance on the 
learning of subsequent and unrelated 
lists (e.g., Greenberg & Underwood, 
1950; Meyer & Miles, 1953). Gates and 
Taylor (1925) testing children investi- 
gated the effect of training in digit 
memory span; this training produced a 
gain of 2.07 digits for the experimental 
subjects following 78 days of practice in 
memorizing digit series of various 
lengths, while the control subjects pro- 
duced a gain of .73 digit. A study by 
Drees (1941) involved _ sixth-grad 

h xth-grade 
children in a paired-associate task for 
which names of trains served as stimuli 
and two-digit numbers as responses. The 
experimental group, receiving three 
months of practice associating other 
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objects with different two-number com- 
binations, improved in subsequent 
paired-associate learning; however, this 
must not be taken as a pure case of 
learning-to-learn transfer since the pos- 
sibility existed that specific number com- 
binations were repeated during the train- 
ing period. Using a different technique, 
Spiker (1960a) demonstrated that train- 
ing fourth-grade students in the use of 
mnemonic devices on one list transferred 
to the learning of a second list. In spite 
of the fact that these studies demon- 
strate in a tangential fashion nonspecific 
improvement in learning, it is evident 
that the proper comparison experiments 
have not been reported so that the de- 
gree and extent of learning-to-learn with 
children has yet to be estimated. 

Warm Up. As another type of non- 
specific transfer, warm up has been dem- 
onstrated in learning with adults (e.g., 
Thune, 1950, 1951), but there are no 
reported studies with children. In an un- 
published study under the direction of 
the author, Richard Piagari manipulated 
degree of warm up on a serial task with 
seventh- and eighth-grade children. Fol- 
lowing four practice trials, the subjects 
received either 0, 7, or 10 trials on serial 
List A. On the second day these groups 
received, respectively, 10, 3, and 0 trials 
on List A, followed immediately by 
List B which was learned to one perfect 
recitation. Since all three groups were 
equated on the degree of learning-to- 
learn, that is, 10 trials on List A and 4 
practice trials, any difference observed 
in learning List B would be a function 
of warm up. In terms of learning on 
List B an inverse relationship was ob- 
tained between the number of warm-up 
trials (List A learning On Day 2) and 
trials to reach criterion on List B. These 
results directly parallel those reported 
by Thune (1950) for adult paired-asso- 
ciate learning. 

Specific Transfer. Although transfer 


designs allow manipulation of various 
similarity relations between stimuli and 
responses across lists, it has been found 
useful to consider key paradigms which 
define the limits of the similarity rela- 
tions. For example, Twedt and Under- 
wood (1959) studied four transfer para- 
digms, A-B, C-D; A-B, C-B; A-B, A-C; 
and A-B, A-Br, the first letters referring 
to the stimulus terms, the second letter 
the response terms, and the relationships 
between stimuli and responses of the 
two lists referring to the similarity ob- 
taining. The last paradigm, A-Br (omit- 
ting the first-list designation), indicates 
a second list in which the stimuli and 
responses of the first list are utilized but 
are repaired in the second list. A recent 
study (Jung, 1962) would seem to indi- 
cate that the amount and direction of 
transfer in the C-B paradigm is depend- 
ent upon the meaningfulness of the re- 
sponse term; that is, with low levels of 
meaningfulness positive transfer may be 
produced while with high levels of mean- 
ingfulness negative transfer is produced. 
Jung interpreted this interaction in terms 
of the transfer of response learning from 
the first to the second list, low levels of 
response meaningfulness producing suffi- 
cient positive transfer through the trans- 
fer of response learning to overcome the 
negative effect produced by interference 
from the backward association (Twedt 
& Underwood, 1959). Presumably the 
same interaction will be obtained be- 
tween the amount of transfer and mean- 
ingfulness for the A-Br paradigm, so 
that a statement of the expected ordering 
of the four paradigms must consider the 
meaningfulness of the response term in 
the C-B and A-Br paradigms. For the 
Twedt-Underwood study, if the C-D 
condition is taken as the baseline for 
zero transfer (i.e., control for nonspe- 
cific transfer), increasing negative trans- 
fer was obtained for the C-B, A-C, and 
A-Br conditions, respectively. These 
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findings are in line with Jung’s predic- 
tions since paired adjectives, which ne- 
cessitate minimal amounts of response 
learning, were used as learning materials. 
Although several investigators have at- 
tempted to isolate various factors to 
account for these transfer results (e.g., 
Barnes, 1960; Keppel & Underwood, 
1962), an attempt will be made here only 
to determine if these general relation- 
ships have been obtained with children. 

Matousek (1939) compared the retro- 

active inhibition (RI) produced by the 
C-D, C-B, and A-C conditions with 
fifth- and sixth-grade subjects. The ma- 
terials consisted of names of men and 
dates, such as Columbus—1503, familiar 
combinations being eliminated. The re- 
sponse terms, although unfamiliar num- 
ber combinations, should probably be 
considered relatively meaningful. All 
groups spent 7 minutes studying the list 
of pairs, followed either by a 2-minute 
recall test or a recognition test in which 
the name was presented with five al- 
ternative dates. The second list was pre- 
sented for 7 minutes, followed by a 
2-minute test; finally, recall was taken 
for the first list. For both recall and 
recognition, the A-C group showed the 
greatest RI, followed by the C-B, and 
finally, the C-D group. The percentage 
RI as measured by the recall method was 
17.7%, 8.7%, and 3.4% for the three 
groups, respectively. To the extent that 
RI and transfer are related, these results 
directly replicate with children, the find- 
ings of Twedt and Underwood with 
adults. 

Other investigators have studied one 
or two of the four paradigms with chil- 
dren. Gladis (1960), using paired one- 
syllable nouns in a C-B paradigm, found 
negative transfer for third- and seventh- 
grade children, but zero transfer for 
fifth-grade subjects. Mean differences 
between experimental and control groups 
in the mean trials to reach criterion were 


—2.2, .2, and —1.3 for the third, fifth, 
and seventh grades, respectively. Spiker 
(1960a) reported a study in which A-C 
and C-D pairs were combined in a single 
mixed-list design. Two groups of fourth- 
grade subjects received either 6 or 15 
trials on the first list, followed by a single 
transfer list. No difference was obtained 
between the A-C and C-D pairs over the 
12 second-list trials for the low degree 
of first-list learning, while significant 
negative transfer was observed for the 
high degree of first-list learning. Finally, 
Iwahara and Sugimura (1960) have re- 
ported the only study in which the A-Br 
condition was employed with children 
(aged 15-19); four degrees of first-list 
learning of paired two-syllable nouns 
were employed. Negative transfer was 
obtained for all but two of the compari- 
sons determined for the several response 
measures, the two deviant comparisons 
producing only slight positive transfer. 

As yet there have been no studies with 
children in which all four paradigms 
were represented. Even though three 
paradigms were studied by Matousek 
(1939), a true paired-associate technique 
was not employed; the newest paradigm, 
A-Br, has been studied only once with 
children and there is a question as to 
whether the estimate from first-list learn- 
ing of nonspecific transfer is the appro- 
priate control comparison to be made. 
The simple replication of the Twedt- 
Underwood study with children of vari- 
ous ages would be of interest, since it is 
possible that factors that have been iso- 
lated for adult learning may not be as 
important for children learning or may 
be of differential influence. For example, 
response learning may be a more impor- 
tant factor for children than for adults 
since the former have not had as much 
experience with the language from which 
the learning materials have been derived. 
If this is the case, systematic studies with 
these key paradigms may prove to be 
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quite valuable for the study of verbal 
learning. 

Interlist Similarity. No studies have 
been reported in which either stimulus or 
response interlist similarity has been 
manipulated with children. 

Mediation. Two studies using children 
have been reported in which an attempt 
was made to extend the findings of Bu- 
gelski and Scharlock (1952) with adult 
subjects. Norcross and Spiker (1958) 
studied positive and negative mediation 
with a mixed-list design in which each 
subject served in the three mediation 
conditions. All subjects received a suc- 
cession of three lists, List 1 consisting of 
six pairs of pictures, with the construc- 
tion of Lists 2 and 3 being dictated by 
the experimental conditions. That is, the 
relationships obtained across the three 
lists allow three mediation paradigms to 
be represented in List 3. Specifically, if 
the stimuli and responses of the three 
lists are expressed by appropriate letters, 
the construction of List 3 produced two 
pairs which formed a positive mediation 
paradigm (A-B, B-C, A-C), two pairs a 
negative paradigm (A-B, B-D, A-C), 
and two control pairs (A-B, D-C, A-C). 
Thirty kindergarten children received 
the three lists. Since these children had 
limited reading ability, each list con- 
sisted of pairs of common pictures for 
which the subject anticipated the name 
of the second picture when presented 
with the first. In the first experiment, 
significant positive mediation was ob- 
tained, while the control and negative 
mediation conditions did not differ sig- 
nificantly. In the second experiment 
only the control and negative conditions 
were studied, with three pairs repre- 
senting each condition instead of two. 
Twenty-four first-grade subjects and a 
faster rate of presentation were em- 
Ployed. These procedures produced sig- 
nificant negative transfer, allowing the 
conclusion that both negative and posi- 


tive mediation may be demonstrated in 
relatively young children. 

Berkson and Cantor (1960) studied 
the positive mediation paradigm with 
normal and retarded elementary school 
children (aged 9-12) to provide more 
tests of mediation along the age dimen- 
sion, ranging from kindergarten-first- 
grade subjects (Norcross & Spiker, 
1958) to college subjects (Bugelski & 
Scharlock, 1952). A design was em- 
ployed in which one half of the subjects 
received the mediation condition (A-B, 
B-C, A-C), the other half of the subjects 
received the control condition (X-B, 
B-C, A-C), where the As were odd num- 
bers, the Xs even numbers, the Bs pic- 
tures of common objects, and the Cs 
various colors. Two comments on design 
may be made: (a) the stimuli, odd num- 
bers for the A stimuli and even numbers 
for the X stimuli, may represent rela- 
tively similar stimulus sets and thus not 
provide a truly neutral control condi- 
tion; (b) the relationship between Lists 
1 and 3 for the experimental group forms 
an A-B, A-C relation while for the con- 
trol group an A-B, C-D relation, thus 
allowing simple transfer effects between 
Lists 1 and 3 (negative) to operate 
against the expected mediational effect 
(positive). However, since significant 
positive transfer was obtained, it is evi- 
dent that any negative effect produced 
by the second factor was not sufficient 
to cancel the positive influence of the 
mediation relation. In an experiment 
with college subjects, McGehee and 
Schulz (1961) studied positive and nega- 
tive mediation, using mediational links 
inferred from word association norms 
rather than specific pair learning. In this 
case, the facilitating effect of the positive 
mediation paradigm was only strong 
enough to overcome the negative transfer 
effect, the experimental and control 
groups producing the same amount of 
transfer. 
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The fact that mediation can occur 

with relatively young children may pro- 
vide a useful source of theoretical predic- 
tions. One obvious question that may be 
asked is whether subjects differing in 
age differ in the strength of, or in the 
use of the implied mediational chains. 
In order to answer this question com- 
parisons between the percentage of sub- 
jects demonstrating mediation may be 
made to determine the number of sub- 
jects using mediational chains for vari- 
ous age groups. Another question refers 
to the theoretical expectation that media- 
tional chains, such as A-B-C, should be 
expected to take a longer time to occur 
than a direct chain of A-C. It follows 
that variations in the anticipation times 
for List 3 should affect the occurrence of 
mediation, longer times allowing media- 
tion to occur, shorter times blocking or 
even inhibiting the runoff of these 
chains. However, if the adult subject 
has developed strong mediational proc- 
esses, then it may be difficult to produce 
this latency effect. On the other hand, 
if children and adults differ in the 
Strengths or use of mediational chains 
and if the strength determines the speed 
with which these chains are runoff, then 
an interaction may be expected in which 
adults are less affected by variations in 
anticipation times on List 3 than are 
children. 

Degree of First-List Learning. Two 
studies previously discussed, Spiker 
(1960a) and Iwahara and Sugimura 
(1960), varied the degree of first-list 
learning in transfer paradigms. Spiker 
found more negative transfer in an A-C 
paradigm with greater first-list learning, 
while Iwahara and Sugimura reported a 
curvilinear relationship between amount 

of negative transfer and degree of first- 
list learning for the A-Br paradigm. The 
results of Iwahara and Sugimura do not 
agree with those recently obtained by 
Postman (1962) who found an increas- 


ing amount of negative transfer as de- 
gree of first-list learning is increased. 
However, since Iwahara and Sugimura 
did not employ a C-D control, a direct 
comparison between the two studies is 
not warranted. In addition, differences 
in the number of pairs (4 and 10) may 
have affected the amount of interference 
generated by the A-Br paradigms for the 
two experiments. 

Summary. A relatively large number 
of studies have involved transfer para- 
digms with children as subjects; all four 
key paradigms and various mediational 
designs are represented, producing phe- 
nomena which have been observed with 
adult subjects. Investigations of learn- 
ing-to-learn, warm up, and interlist simi- 
larity in children have not been reported. 
Future work with children should per- 
haps be directed towards the goal of fur- 
ther isolation of factors thought to be 
operating in these transfer and mediation 
designs. Adult subjects may represent 
learning in which several factors are op- 
erating concomitantly, so that the study 
of children may allow for a more “pure” 
case of whatever factors are theoretically 
assumed to be occurring in adults. 


SIMPLE RETENTION 


The phenomenon of simple retention 
refers to the operations of taking recall 
at various intervals following the learn- 
ing of a single task. Research in this 
area has been limited by various meth- 
odological problems. For example, Un- 
derwood (1957) demonstrated that the 
use of the counterbalanced design in the 
study of retention introduces the prob- 
lem of proaction from previously ac- 
quired lists interfering at recall. Simi- 
larly, problems of degree of learning 
(Underwood, 1954b) and of intra- and 
interlist similarity relations (Under- 
wood, 1954a) have posed experimental 
and measurement problems for research- 
ers in this area. Experimental work with 
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children has not been affected by these 
problems for the simple reason that, in 
general, independent groups have been 
used and few variables have been ma- 
nipulated. For the subject naive to 
verbal-learning experiments, any forget- 
ting observed has been thought to be due 
largely to interference from well-learned 
language habits. For example, recent 
statements of interference theory (e.g., 
Postman, 1961; Underwood & Postman, 
1960) have leaned heavily on the pre- 
mise that much of verbal learning in- 
volves the unlearning of previously 
learned, stronger letter- or word-lan- 
guage habits. Viewed in this way, the 
difficulty in learning a low meaningful 
list of nonsense syllables may be thought 
of as requiring the temporary modifica- 
tion of the verbal repertoire the subject 
brings with him to the laboratory, and 
the forgetting observed over 24 hours 
being due, in part, to the regeneration or 
spontaneous recovery of these stronger 
language habits which interfere with re- 
call. If it is assumed that the child has a 
less complicated and less practiced asso- 
ciative network, then less forgetting may 
be expected for children than for college 
students. Unfortunately, no available 
data allow a test of this expectation. 
The evidence presented below repre- 
sents, with one exception, retention 
values for children roughly approximat- 
ing the 80%-90% obtained from college 
subjects (e.g., Underwood & Keppel, 
1962), The retention of children in 
these studies might in fact be expected 
to be less than the values obtained for 
College subjects. The reason for this is 
that the low degrees of learning em- 
Ployed in the children studies may result 
in associations which are more suscepti- 
ble to interference than those produced 
by the higher levels of learning em- 
ployed in the studies with college sub- 
jects, 


The data on the retention of children 


noted above come from a series of ex- 
periments from the Catholic University 
of America which allows the determina- 
tion of the retention of children naive 
to verbal-learning experiments. Lahey 
(1937) reported for control groups 
studying a list of 25 verbs for 5 minutes, 
24-hour retention values ranging from 
72%-82%. Houlahan (1937), using the 
same materials as Lahey, reported con- 
trol-group retention of 96% after 17 
minutes and 85% after 24 hours. Mc- 
Dade (1943) had the subjects study a 
list of 20 objects for 30 seconds, with 
retention for the names of the objects 
being taken at various intervals. For 
younger subjects (MA=9.5), 93% re- 
tention was observed and for older sub- 
jects (MA=12.5), 89% retention was 
obtained after 24 hours. Finally, Sharpe 
(1952) studied the retention of the 
meaning of a prose passage by adminis- 
tering a series of questions to various 
groups; 87% retention was obtained 
after 24 hours. One contradiction to 
these data is a study by Otto (1961) in 
which five geometrical form-nonsense 
syllable pairs were learned by children 
from several grade levels (Grades 2, 
4, and 6). Retention measures taken 24 
hours after the attainment of one per- 
fect recitation revealed approximately 
63% to be retained. Clearly, in view of 
the wide differences in the degree and 
methods of learning and in the materials 
used represented in the above experi- 
ments, systematic studies in which de- 
gree of learning is held constant across 
age levels will be necessary to determine 
if the expectation of less forgetting by 
children is to be adequately tested. 
Two studies with children have in- 
volved the manipulation of a subject 
variable in retention. Gregory and 
Bunch (1959) compared slow and fast 
subjects selected from a population of 
elementary school children. Fast sub- 
jects were drawn from the upper quartile 
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of the distribution, slow subjects from 
the lower quartile. Each group of 40 
subjects was subdivided into two smaller 
groups, one being tested immediately 
after learning, the other 24 hours after 
learning. The pairs consisted of 10 geo- 
metrical drawings paired with the num- 
bers 1-10, a response discovery tech- 
nique being used; learning was taken to 
the criterion of one perfect recitation. 
The fast learners tended to recall more 
numbers after 24 hours than the slow 
learners, but the difference was not sig- 
nificant. In addition, since degree of 
learning for the two ability groups is not 
equated in this comparison, it is not 
clear to what extent ability level pro- 
duced differences in retention (see Un- 
derwood, 1954b). Specifically, the mean 
errors after 24 hours for the fast and 
slow groups were 2.8 and 4.1, respec- 
tively, while the two immediate reten- 
tion groups produced mean error scores 
of 9 and 2.0, respectively. Loss scores, 
obtained by subtracting the respective 
0- and 24-hour groups, produces losses 
of 1.9 and 2.1 for fast and slow learners, 
indicating the absence of an effect of 
ability level on retention. This conclu- 
sion 1s supported by the experi 
oe eet who fama tillers 
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Summary. Systematic studies of re- 
tention using children as subjects have 
not been conducted. For the evidence 
that is available, it would appear that 
Underwood’s (1957) estimate of 10%- 
20% forgetting over 24 hours is a Tea- 
sonable one for nonpracticed children, a 
finding which should be quite heartening 
to the teacher who, in reading introduc- 
tory psychology tests, has been told to 
expect 60%-70% forgetting overnight. 
Aside from the practical implications 
that research on retention has for educa- 
tion, the study of retention in children 
would appear to offer the verbal-learn- 


ing theorist potential tests of current 
interference theory. 


RETROACTIVE AND PROACTIVE 
INHIBITION 


Although retroactive inhibition (RI) 
has received a great deal of attention 
since 1900, little systematic research has 
been done with children. For example, a 
recent review by Slamecka and Ceraso 
(1960) of RI and PI studies since 1940, 
does not mention specifically any studies 
in which children were used as subjects. 
Whitely (1927) was the first to investi- 
gate RI in children, finding more RI 
with high school and grammar school 
subjects than with college students; 
however, since the experimental condi- 
tions were not the same for all groups, 
these comparisons must be viewed with 
caution. Several investigators have re- 
ported an inverse relation between de- 
gree of RI and age and IQ (e.g., Houla- 
han, 1937; Lahey, 1937; Matousek, 
1939). Since degree of RI depends upon 
both degrees of original learning (OL) 
and interpolated learning (IL), and 
since learning rate appears to be a direct 
function of age or IQ, these studies pro- 
vide little by way of explaining the ob- 
tained inverse relationship between age 
and IQ and RI. An experiment in which 
degree of learning is held constant for 
all age groups has not been accom- 
plished. Therefore, the relationship be- 
tween RI and age cannot be interpreted 
to mean that younger subjects are more 
susceptible to interference until the de- 
gree of learning problem has been solved. 

The studies that have manipulated 
task variables in an RI design have pro- 
duced results which corroborate those ob- 
tained with adult subjects. Matousek 
(1939) demonstrated that the percent- 
age RI depends upon the similarity rela- 
tionships between the two experimental 
lists. Lahey (1937) found percentage 
RI to increase as the IL study time in- 
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creased from 4 to 14 minutes. Houlahan 
(1937) varied the temporal position of 
the interpolated list within a constant 
OL-recall interval of 17 minutes, finding 
maximum RI when the interpolated list 
was presented immediately before recall 
of the first list; this effect replicated the 
findings with college subjects by Mc- 
Geoch (1933a, 1933b) who found maxi- 
mum RI when IL occurred immediately 
before recall of the first list. The results 
of Deese and Marder (1957) who, with 
high school students, found the number 
of between-list errors to increase from a 
4-minute interval to a 48-hour interval 
is consistent with the notion of differen- 
tiation (Underwood, 1945) in which the 
subject’s discrimination between OL and 
IL is predicted to decrease as the IL-RL 
interval is increased. Finally, Cassel 
(1957) found no difference in the 
amounts of RI between aments and 
children matched in MA. 


FINAL COMMENTS 


A review of the available literature on 
children’s verbal learning has revealed a 
surprising correspondence in the effective 
variables reported and the relationships 
that have been identified with adult sub- 
jects. Although suggestions for further 
study have been indicated at various 
points in this review, a number of com- 
ments with regard to the future use of 
children in verbal-learning experiments 
Seems appropriate at this time. 

First, the developmental study, per se, 
is probably of little value to the verbal- 
learning theorist unless differential re- 
sults are to be expected by theory. This 
is not to say that all research involving 
children’s verbal learning must be di- 
rectly tied to theory, but that the mere 
replication of adult studies may offer 
little to theory directly. To know that 
the memory span increases with chrono- 
logical age is certainly of interest to the 
educational and developmental psycholo- 


gist, but unless this relationship can be 
translated into extant theories and pro- 
duce predictions that will provide rea- 
sonable tests of these theories, the find- 
ing is of no immediate use to the verbal- 
learning theorist. Deese (1952) has said 
this another way when referring to re- 
search of the relative learning abilities 
of children, 

certainly there is very little novel or new 
information which comes out of this research. 
Perhaps the next few years may see a revival 
in interest in this field, with an emphasis upon 
the causes of individual differences with age 
rather than the simple measurement of such 
individual differences [p. 297]. 

Second, research with children may 
provide information with regard to new 
or raw learning, that is, learning which 
may be considered to have occurred for 
the first time, rather than being based 
on previously Jearned mediators or asso- 
ciations. For example, many adult sub- 
jects, in describing the learning of a 
paired-associate list, report the use of 
mediational links between the stimulus 
and the response, rather than a direct 
association between them (e.g., Bugel- 
ski, 1962). This would seem to indicate 
that the learning observed in these ex- 
periments involves largely a transfer of 
past learning rather than the acquisition 
of a de novo association. But these 
mediators must have been acquired at 
some time in the history of the subject 
and research with children may shed 
some light on this problem. 

Finally, the extension of theories 
based on the learning of college students 
to the learning of children would seem 
both an important and a necessary step 
in the field of verbal learning. This point 
is not a contradiction of the first point, 
however, since the issue here involves 
tests of theory and application of find- 
ings based on college students to younger 
and possibly older members of the same 
species. 
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A PERVASIVE PROBLEM FOR PSYCHOLOGY * 
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After a brief historical introduction and a discussion of methodological 
issues, this substantive review concentrates upon the relevant literature 
of the past 5 years. Special attention is given to the writer’s own experi- 
ments on the effect of subliminal anchors upon psychophysical judgments. 


(86 ref.) 


A review of the history of modern psy- 
chology yields relatively few general 
principles of marked significance. Among 
these few there are two the importance 
of which is generally recognized: mental 
events are capable of measurement and 
behavior may be determined by immedi- 
ate variables of which the behaving indi- 
vidual is essentially unaware. The first 
of these is identified with Fechner. 
Though the notion of measurement of 
mental phenomena goes back at least as 
far as Herbart and Weber, it was 
Fechner who first formulated a general 
expression for the psychophysical rela- 
tionship and who codified a set of pro- 
cedures by which this could be tested. 
The second, the notion of the uncon- 
scious, is identified with Freudian psy- 
chology. Unlike the former, which re- 
flects the rational approach of nineteenth 
century laboratory science, it resulted 
from clinical intuition and has failed 
general acceptance among psychologists 
until the relatively recent past. Indeed, 
theoretically oriented experimental in- 
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entitled, “The Evolution of Perceptual Frames 
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vestigations of the unconscious date only 

from the past 1 or 2 decades. 
Meanwhile, curiosity about the possi- 
ble influence of subthreshold stimulation 
has been in evidence for some time. Ac- 
cording to Baker (1937), the earliest 
experiment is that of Suslowa, who 
studied the effect of weak electrical 
stimulation upon the two-point thresh- 
old. Unfortunately, a variety of situa- 
tions has been used in these studies and 
the definition of subliminal stimulation 
remains unclear. In some, the critical 
stimuli are ostensibly below the limits of 
detectability (absolute threshold). An 
example of this sort of experiment is 
the recent study of Black and Bevan 
(1960) in which subliminal stimuli were 
introduced prior to the presentation of 
supraliminal .series stimuli in order to 
determine possible effects of the former 
upon the latter. In others, the critical 
stimuli closely resemble or are masked 
by other elements of the stimulus com- 
plex such that confident discrimination 
is not possible, This type of study may 
be illustrated by Day’s (1956) study of 
serial nonrandomness in the auditory 
differential threshold. Finally, in a third 
class of situations, the critical stimuli 
are available for perception but lie out- 
side the field of attention. Here the 
81 
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verbal conditioning study of Greenspoon 
(1955) provides a convenient example. 
The “New Look” Approach to Sub- 
liminal Perception. Interest in sub- 
liminal perception during the last decade 
and a half has been dominated by a 
concern for perceptual defense. The 
pioneer experiment on this phenomenon 
is the McGinnies (1949) “dirty-word” 
experiment. In this study taboo words 
had elevated recognition thresholds and 
evoked GSR responses prior to the cor- 
rect report of them by the subject. A 
variety of criticisms has been made of 
this and similar studies. It has been 
argued that the difference in threshold 
between neutral and taboo words re- 
flects a difference in familiarity (Solo- 
mon & Howes, 1951) and that, in the 
case of taboo words, the subject may be 
suppressing rather than repressing his 
response. An attempt to untangle these 
problems has given rise to experiments 
like that of Lazarus and McCleary 
(1951) who simulated the anxiety re- 
sponse associated with the taboo words 
by a pretest application of conditioning 
shock along with the presentation of 
certain nonsense syllables. The sub- 
sequent recognition thresholds for these 
taboo syllables were reliably higher than 
for other neutral syllables, although 
there was no difference in familiarity 
among the syllables nor was there any 
reason to expect suppression. 
The theoretical antithesis of percep- 
tual defense is selective sensitization. 
If anxiety is expected to raise thresholds, 
positive values are said to lower them. 
Thus, McClelland and Liberman (1949) 
found that subjects with high need 
achievement scores on the TAT had low 
recognition thresholds for achievement- 
related words, while low achievers more 
quickly recognized security-related 
words. But the results on sensitization, 
like those on defense, are not completely 
consistent. For example, Gilchrist, Lude- 


man, and Lysak (1954) reported an in- 
crease in threshold for both positively 
and negatively toned words; and, indeed, 
there is much everyday evidence to sup- 
port the common-sense expectation that, 
under certain circumstances, the appro- 
priate response to a threatening stimulus 
is vigilance and a lowered threshold. A 
major problem lies in differentiating the 
circumstances calling for defense, on the 
one hand, and vigilance on the other. 
It is interesting to speculate about the 
difference between stimuli that threaten 
the psychological integrity of the per- 
ceiver—his own good impression of him- 
self and his physical integrity—his 
safety or comfort. This is, perhaps, a 
difference between anxiety and fear or 
between panic and pain. One may an- 
ticipate that fear or pain, in contrast to 
anxiety or panic, are associated with 
vigilance. Certain of the data to be 
described later appear to be in line with 
this supposition. 

Eriksen (1952) early emphasized the 
importance of considering between-sub- 
ject differences in the design of experi- 
ments on subliminal stimulation. An 
experiment by Carpenter, Weiner, and 
Carpenter (1956) illustrated this impor- 
tance. Subjects classified as sex sensi- 
tizers, or as hostile, perceived sex-related 
or hostile stimulus words more readily 
than sex repressers or nonhostile sub- 
jects. Recognition of the importance 
of individual differences points to a 
source of weakness in many studies of 
subliminal stimulation. Common prac- 
tice has been to identify as subliminal 
any stimulus magnitude that is less than 
either the absolute or the differential 
threshold. But a precaution is in order. 
The threshold is a statistical estimate 
and for a stimulus to be consistently sub- 
liminal, it must lie outside of the popula- 
tion of values from which the threshold 
is derived. A single supraliminal re- 
sponse to a presumed subliminal stimu- 
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lus forces recognition of the fact that 
some indeterminate number of failures 
to perceive it are false negative re- 
sponses. And, as Goldiamond (1958) 
and Eriksen (1956b) observed, if false 
failures at recognition occur in sufficient 
number, they will create a false impres- 
sion of a subliminal effect. 


METHODOLOGICAL CONSIDERATIONS 


From its inception, the recent work 
on perceptual defense has been plagued 
by methodological problems. It was 
early objected that threshold differences 
between taboo and neutral words re- 
flected differences in familiarity rather 
than defense (e.g., Howes & Solomon, 
1950). More fundamental is the ques- 
tion of perceptual defense versus re- 
sponse suppression. For some time, the 
best that has been done has been to sub- 
ject the subject to posttest interrogation 
about his experience. Meanwhile, Gar- 
ner, Hake, and Eriksen (1956) have 
proposed the principle of converging 
operations as a device for differentiating 
among intervening processes and have 
applied this to the problem of subcep- 
tion. To demonstrate a perceptual dis- 
crimination below the level of awareness, 
several experiments are necessary, even 
if one equates consciousness with verbal 
report. Under these circumstances it is 
necessary to demonstrate a higher cor- 


relation between stimulation and GSR 
(or some other physiological index) than 
between stimulation and all verbal re- 
sponse systems that can be identified. 
Edwards (1960) has shown that 
recognition thresholds for tachistoscop!- 
cally presented words vary with the 
response method employed (multiple- 
choice, discerning guess, and free verbal- 
ization) and has concluded that sub- 
liminal perception is a methodological 
artifact. Granting a relationship be- 
tween form of response and recognition 
threshold, Edwards’ conclusion 1S not 


warranted unless it is demonstrated that 
nonverbal response systems fail to cor- 
relate with stimulation. Meanwhile, 
Mathews and Wertheimer (1958) had 
already proposed a pure measure of de- 
fense independent of response suppres- 
sion. In this, subjects are given a list 
consisting of an equal number of neutral 
and emotion-provoking words. These 
are all identified as words to be tachisto- 
scopically presented. In fact, only half 
the words listed are presented. By sub- 
tracting a performance measure for the 
“stimulus absent” words (which repre- 
sent response bias) from the “stimulus 
present” words (which relate to both 
perceptual and response tendencies), one 
arrives at an estimate of “pure” per- 
ceptual tendency. A comparison of these 
difference scores for the neutral and the 
emotional words is then assumed to 
indicate the operation of perceptual de- 
fense. Mathews and Wertheimer re- 
ported such a demonstration for subjects 
scoring high on the Hysteria scale of the 
MMPI but failed to find similar evidence 
in the responses of subjects scoring high 
on the Psychasthenia scale. 

The stereoscope has been used in two 
experiments (Davis, 1959; Van de 
Castle, 1960) as a device for studying 
perceptual defense. The technique in- 
volves presenting different material 
(words or pictures) to each eye and re- 
cording what the subject reports. This 
would seem at first hand to be an excel- 
lent solution to the defense-suppression 
problem, since it deals with supraliminal 
material. However, it always allows for 
a neutral alternative in response, and 
while the perceptually hypersensitive 
subject may be readily identified, it 1s 
not possible to identify the defender as 
defender or suppressor. Furthermore, 
the responses are not always “either-or. 
While the subject may report either the 
emotionally toned or the neutral stimu- 
lus, he may also see both, or parts of 
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both, or even some integration of the 
two. Van de Castle (1960) has proposed 
to resolve this problem by arbitrarily 
ordering and assigning numerical weights 
to each of the possible responses. Fi- 
nally, the subject’s response is influenced 
by eye dominance and it is not Possible 
to correct for this factor simply by coun- 
terbalancing the position of the emo- 
tionally toned stimulus. 
Examination of the literature on sub- 
ception and perceptual defense reveals 
no major methodological breakthrough, 
with the exception of the principle of 
convergent operations, and this latter 
has yet to be translated into a particu- 
Jar set of empirical procedures in the 
service of this area of inquiry. Eriksen 
(1958), in his Nebraska symposium 
paper, has performed an exhaustive 
methodological analysis of the broad 
spectrum of studies of discrimination 
without awareness and concluded that 
there is yet no empirical basis for infer- 
ring the existence of a superdiscrimina- 
tory system. Rather, he proposes that 
when the stimulus is extremely weak, 
the Subject may respond to fragmentary 
cues, with these cues indicating the pres- 
ence of a stimulus for one response sys- 
tem and not for another, In the case of 
subception, he suggested that the typical 
experimental conditions do not allow 
the subject to state verbally all the dis- 
crimination he is capable of making. In 
a 1962 paper Davis and Hess Proposed 
to measure awareness by rating the sub- 
ject’s responses to open-ended questions 
following a series of anagram problems, 
Here, of course, awareness is equated 
with verbal report and the procedure ex- 
cludes the possibility of relevant proc- 
esses that do not fit this category. 


RECENT LITERATURE ON SUBLIMINAL 
PERCEPTION 


Despite the nagging of methodological 
issues, a lively interest in subliminal 


perception persists to the present. In the 
past 5 years there have been, in addition 
to Eriksen’s (1958) methodological re- 
view, another primarily methodological 
review (Goldiamond, 1958), several sub- 
stantive reviews (Adams, 1957; Jenkins, 
1957; McConnell, Cutler, & McNeil, 
1958), the report of a symposium (Erik- 
sen, 1962), and a monograph (W. P. 
Brown, 1961). The topic has received 
special attention in introductory text- 
books and in recent texts on motivation 
(e.g., J. S. Brown, 1961; Hall, 1961). 
The Brown monograph (1961) com- 
prises a comprehensive treatment of the 
phenomena associated with perceptual 
defense, including an extensive substan- 
tive review of the literature, a discussion 
of both the definitional and methodolog- 
ical problems associated with the 
phenomenon, a critical evaluation of 
five theoretical approaches, a detailed 
presentation of one of the author’s own 
experiments, and an exposition of his 
own theoretical orientation. In the 
period between Brown’s latest literature 
citation and the summer of 1962 at least 
34 experimental studies have appeared. 

That the topic has come of age is in- 
dicated by several facts. Experiments 
are no longer simply directed toward 
establishing subliminal perception as an 
empirical phenomenon but incline more 
toward examining its influence upon a 
variety of behavioral events. In addi- 
tion, there has been interest in the 
commercial exploitation of the principle 
of subliminal perception. In this section 
we shall be concerned Primarily with 
summarizing the results of this recent 
empirical literature. 

Early in the 1950s interest in sub- 
liminal preception was concentrated 
mainly on perceptual defense. Experi- 
menters treated subliminal perception, 
for the most part, as the dependent 
variable and concern centered about the 
identification of the variables, primarily 
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stimulus variables, that influenced it. 
Subliminal perception was identified in- 
directly by the occurrence of physiologi- 
cal change suggestive of emotional 
reaction and accompanying a raised 
recognition threshold for emotion-pro- 
voking stimuli. Both the physical and 
the behavioral properties of stimuli were 
manipulated as independent variables. 
Early in the 1960s, interest in subliminal 
perception appears to have shifted to- 
ward its relationship to central—par- 
ticularly personality—variables and to 
its role as an independent variable 
affecting other behavioral processes. 

Stimulus Determinants of Perceptual 
Thresholds. Three characteristics of 
stimuli—their capacity for affective 
arousal, their familiarity or frequency 
of occurrence, and their physical inten- 
sity—still provoke experimental interest. 

Spence (1957) has proposed that 
anxiety related to threatening stimuli 
disrupts prerecognition hypotheses near 
threshold to produce either an elevation 
or a lowering of the recognition thresh- 
old, and his experimental results have 
indicated no systematic trend toward a 
defensive, in contrast to a vigilant, re- 
action to anxiety-related words. 

In order to meet methodological 
criticisms of the Lazarus and McCleary 
(1951) subception experiment, Dixon 
(1958b) not only measured recognition 
thresholds for emotional and neutral 
words but required his subjects to guess 
the identity of subliminally presented 
words, to give free-association responses 
to these stimuli, and then to match their 
guesses to the stimuli they thought had 
been presented. While no reliable in- 
crease in latency was noted for the 
emotional words, the emotional words 
Provoked larger GSRs and five of ew 
Seven subjects matched their guesses to 
the stimuli that accompanied them more 
frequently than could be expected z 
chance. Questioning at several stages 0: 


the experiment revealed no evidence of 
awareness of the subliminally presented 
stimuli. Fuhrer and Eriksen (1960) re- 
peated Dixon’s experiment, but were 
unable to obtain better than chance 
matches between presented words and 
guessed responses. When the experi- 
ment was extended to include a group 
receiving the stimuli at a higher illumi- 
nation level and another given the 
stimuli backwards and upside down, 
better than chance matches were ob- 
tained. These latter results, however, 
would appear to be attributable to 
structural differences among the stimuli. 
Meanwhile, Eriksen, Azuma, and 
Hicks (1959) reported that subjects can 
identify the affective quality of a stimu- 
lus word reliably when they fail at 
actual recognition, although there was 
no evidence of a lower discrimination 
threshold for affective judgments than 
for specific identifications. This they 
took as evidence supporting Eriksen’s 
(1958) concurrent response theory. 
Lowenfeld (1961) has objected that 
the experimental procedure in studies of 
subception and defense have not met the 
conditions of accompanying theory, 
since they have dealt primarily with 
responses to noxious stimuli or stimuli 
rendered noxious. More representative 
studies, he argued, would involve nega- 
tive affect resulting from the associa- 
tive connotations of stimuli rather than 
any noxious quality per se. Accordingly, 
prior to performing the subception ex- 
periment, he induced negative affect into 
his emotional stimuli by pairing them in 
a nonsense-syllable exercise with nega- 
tively reinforced response syllables. His 
data supported the occurrence of sub- 
liminal perception under these condi- 
tions. A 
Using the technique of stereoscopic 
perception rather than the threshold 
technique, Davis (1959) reported that 
emotionally toned words as well as less 
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familiar (ie., less frequently used) 
words were perceived less frequently 
when presented simultaneously with 
neutral or familiar words. There was, 
however, no significant interaction be- 
tween these two stimulus variables. 

In the Dixon (1958b) experiment 
described above the methodological goal 
was to remove response bias associated 
with the design of the subception experi- 

ment. Taylor, Rosenfeldt, and Schulz 
(1961) attacked the same problem in 
the design of word frequency experi- 
ments with a variant of the forced- 
choice technique. They required that 
subjects indicate the spatial position of 
key words and found, with response bias 
controlled, frequency of usage to be 
related to ease of recognition. 

In another experiment Dixon (1958a) 
used the method of closed-loop control 
to differentiate the perceptual threshold 
from the threshold for verbal report. 
In this procedure, the subject kept a 
continuous threshold setting of a spot 
of light projected into the left eye. 
Verbal stimuli, emotional and neutral, 
were always presented to the right eye 
at intensities below this value. Generally 
higher thresholds for the left eye were 
found to accompany taboo words. 

These recent experiments on stimulus 
determinants of word recognition con- 
tinue the attempts of earlier studies to 
clarify the status of subliminal percep- 
tion as a possible methodological arti- 
fact. Although results are sometimes 
inconsistent, the appearance of the effect 
appears to persist through each meth- 
odological refinement. 

Subliminal Perception as the Percep- 
tion of Partial Cues. The view of per- 
ception that has emerged from the lit- 
erature on subliminal processes 1s that it 
is an extremely complex response organ- 
ized in hierarchical fashion, some facets 
capable of appearance prior to recog- 


nition or verbal identification. Mean- 


WILLIAM BEVAN 


while, as indicated earlier, Eriksen 
(1956b) has contended that what is iden- 
tified as subliminal perception in the 
laboratory may result from restrictions 
placed by the experimental design on the 
available number of verbal responses to 
word stimuli. Boardman (1957) ex- 
tended this prerecognition hypothesis 
by arguing that subjects in the subcep- 
tion experiments are capable of discrimi- 
nating structural differences between 
tachistoscopically presented shocked and 
nonshocked words below verbal recogni- 
tion levels. He further contended that 
observed differences in GSR reflect this 
discrimination of elements of partial 
cues. He demonstrated that almost 
half of the prerecognition hypotheses 
concerning taboo words contained two 
or more letters of the four-letter word 
presented, while neutral dissimilar words 
were seldom identified as taboo or taboo- 
similar words. In contrast, neutral words 
structurally resembling taboo words were 
frequently confused with the latter as 
prerecognition hypotheses. 

Goldberg and Fiss (1959) suggested 
that this “partial information” hypothe- 
sis applies to the “discrimination with- 
out awareness” literature as well as to 
the subception data, and found with 
tachistoscopically exposed geometric 
figures that partial cues facilitated per- 
formance, although when the stimulus 
was only a blur, the frequency of correct 
identifications did not exceed chance. 
They therefore interpreted discrimina- 
tion without awareness as a laboratory 
artifact resulting from a difference be- 
tween the amount of information the 
subject recognizes he has available and 
his guess about the identity of the stimu- 
lus object. 

In a 1960 study of the perception of 
partial cues, Weiner and Schiller (1960) 
found that the number of correct iden- 
tifications of ambiguous figures was 
directly related to the stimulus intensity 
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of the identifying cue. They further 
strengthened the case for the perception 
of partial cues by demonstrating that 
the GSR generalization gradient in sub- 
liminal perception is related to the 
structural similarity in contrast to the 
semantic relatedness of verbal stimuli 
and that the amount of recognition that 
occurs in subliminal perception relates 
to the amount of peripheral stimulation 
the subject can perceive. Their finding 
dealing with the GSR gradient contrasts 
with Spence’s (1961) report that sub- 
liminal words may facilitate the con- 
scious discrimination of synonyms as 
well as stimulus words themselves and 
points out that the extent and type of 
generalization also depend, among other 
things, upon the response measures used. 

Subliminal Perception and Phenomena 
of Learning, Identifying the role of 
partial cues in subliminal perception 
prompts an examination of these phe- 
nomena in terms of learning variables. 
Boardman (1957) reported that the 
number of correct elements in a series 
of prerecognition hypotheses increases 
over trials. This indicates a progres- 
sive lowering of the threshold for 
elements with practice and dictates the 
need for a methodological control to 
insure that stimuli are in fact subliminal 
when they are presented. Meanwhile, 
Dulany (1957) suggested that the com- 
plementary modes of perceptual response 
—vigilance and defense—can be recon- 
ciled as the result of avoidance learning. 
Perceptual defense is held to be a learned 
response produced by the punishment of 
a critical response while competing Te 
sponses are instrumental to avoidance of 
punishment and consequent anxiety re- 
duction, Vigilance results from the op- 
posite set of circumstances. 

In an experiment to test his n 
he had subjects rate recognizability of 
simple geometric figures; he shocked 
some whenever a critical stimulus aP- 


s notion, 


peared clearest and others whenever any 
noncritical figure was presented. In 
posttraining recognizability tests a pre- 
dominant number of “defense” subjects 
displayed a reduction in clarity of the 
critical figures while a comparable num- 
ber of “vigilance” subjects showed the 
opposite tendency. Similarly, Banks and 
Walters (1959) found that verbal rein- 
forcement of taboo words in a guessing 
game resulted in reduced recognition 
thresholds for this class of stimuli in 
recognition tests. In contrast, Goldstein 
and Himmelfarb (1962) reported that 
tachistoscopically presented, emotionally 
toned words were less frequently recog- 
nized than neutral words, and that 
knowledge of results on the first 25% 
of the trials had no effect on perform- 
ance. 

Meanwhile, Lowenfeld, Rubenfeld, 
and Guthrie (1956) reported that the 
GSR response identified with subception 
in the Lazarus and McCleary (1951) 
experiment cannot be inhibited by verbal 
instructions as long as the reinforced 
stimuli remain below the recognition 
threshold. In a sequel to this study of 
verbal inhibition, Rubenfeld, Lowenfeld, 
and Guthrie (1956) examined the possi- 
bility of stimulus generalization in sub- 
ception. Subception effects were ob- 
tained and stimulus generalization of 
GSR responses occurred to a greater 
extent when figures were confused in 
identification than when they were ac- 
curately discriminated. 

Set and the Recognition Threshold. 
It is amply clear that recognition 
thresholds are not solely a function of 
the properties—either physical or psy- 
chological—of the particular stimulus 
giving rise to the measurement. As 
Boardman (1957) reported, the accuracy 
of prerecognition hypotheses on indi- 
vidual trials increases as a test-sequence 
proceeds, so that what may be initially a 
blur during tachistoscopic presentation 
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early in an experimental series may be 
quite clear at the same speed later on. 
Thus test attitude, identified with task 
familiarity, may be an important con- 
sideration in the correct interpretation 
of defense and other subliminal phe- 
nomena. The effects of attitudinal factors 
are seen in other ways. If a class of words 
is identified by the subject during the 
test series, expectations resulting from 
this are likely to produce confusion in 
the prerecognition hypotheses. In this 
connection, the presence of offensive 
words in Boardman’s experiment led to 
the use of taboo-word hypotheses with 
neutral stimuli of similar physical ap- 
pearance. Here one has an elevation of 
threshold quite the opposite of that 
found in defense experiments, where the 
recognition of a taboo word is withheld 
by the persistence of neutral-word 
hypotheses. 

The directive effect of sets where the 
result is one of enhancing perceptual 
acuity is also seen in an experiment by 
Hoisington and Spencer (1958). Lists 
of words were presented at a distance 
such that no words on any list could be 
correctly identified. However, when the 
subject was given one word from each 
list and asked to locate its position in the 
list it fairly “popped” into clarity. This 
is suggestive of the role of partial cues in 
recognition. When the subject knows 
what to look for he has less difficulty in 
organizing his perceptual field so as to 
recognize the critical stimuli. 

Before such effects can be unequivo- 
cally identified as effects upon perception 
their relation to response bias must be 
clarified. As noted earlier, a number of 

students of the defense phenomenon 
(€g. Eriksen, 1958; Goldiamond, 1958) 
have proposed to identify this with re- 
sponse bias rather than with perceptual 
inhibition. In their view elevated recogni- 
tion thresholds for taboo words we 
from the smaller likelihood that subjects 
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in the formal laboratory situation will 
entertain prerecognition hypotheses for 
the category of taboo words. Goldstein 
(1962) and Goldstein, Himmelfarb, and 
Feder (1962) have reported several ex- 
periments that are directed toward segre- 
gating the roles of the perceptual and 
response variables. In one, subjects were 
required to identify flashed stimuli 
(either nonsense syllables or words) 
from a word list of both neutral and 
anxiety-arousing words. Subjects given 
word stimuli displayed a defense effect, 
correctly identifying fewer emotionally 
toned than neutral words. However, 
subjects tested with nonsense syllables 
showed the same defense effect, prompt- 
ing the conclusion that defense may 
result from response bias. In the other 
experiment, designed to eliminate re- 
sponse bias, no evidence of defense was 
apparent. While this result may seem to 
support the identification of defense with 
response bias, it may also be attrib- 
utable to a blocking of the anxiety- 
reducing function of defense by the 
forced-choice experimental method used. 

The need to differentiate between per- 
ception and response bias is nicely 
illustrated by an experiment by Goldia- 
mond and Hawkins (1958). These 
experimenters required subjects to learn 
nonsense syllables and then take a visual 
recognition test. During this latter test 
no words were in fact flashed on the 
screen, but “recognition thresholds” were 
found to be related to the training fre- 
quency for the nonsense syllables. 

A recent experiment of Phares (1962); 
reminiscent of the Lazarus and McCleary 
(1951) study, relates to the anxiety- 
reduction hypothesis of defense. When 
pretest avoidance learning was possible 
because of a systematic relation between 
the presentation of shock and certain 
nonsense syllables, recognition thresh- 
olds for these syllables were lowered. In 
contrast, when avoidance learning was 
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thwarted by a random relationship be- 
tween the presentation of syllables and 
shock, recognition thresholds showed no 
significant drop. Furthermore, this be- 
tween-conditions difference in recogni- 
tion threshold held equally well for 
shocked and nonshocked syllables, sug- 
gesting a generalization of the anxiety- 
reduction effect. Here, however, defense 
would appear to be identified with the 
failure of anxiety reduction. 

Affect and Personality. The early 
experiments on defense and subception 
were directed toward exploring the role 
of the affective or affect related prop- 
erties of stimuli upon perception. The 
articulation of this line of research seems 
frequently to have been interfered with 
by methodological issues and the intru- 
sion of other variables. In one of the 
few studies in the last few years con- 
cerned directly with affect, Lowenfeld 
(1961) describes experimental analogues 
of repression, subception, and defense 
resulting from negative affect induced 
into verbal stimuli through applying 
shock to corresponding response stimuli. 
This latter technique extends that of 
Lazarus and McCleary (1951), in that 
it probably simulates the operations by 
which affective tone is induced outside 
the laboratory. 

Related to this interest in the mecha- 
nisms by which affect relates to the 
subliminal phenomena are considerations 
of individual differences and personality 
variables. Pustell (1957) found striking 
sex differences in response to threat 
stimuli. Males showed @ predominant 
tendency to display vigilance in the face 
of threat stimuli, while females displayed 
a less marked tendency toward defense. 
To explain this Pustell fell back upon a 
suggestion of Dollard and Miller that 
when stimulation is mild it functions as 
a cue whereas when it is more intense it 
acquires drive properties. By assuming 
that his female subjects experienced 


more intense anxiety during testing, he 
saw defense as a means of reducing an 
unpleasant drive. Vigilance is the male 
response to what was for them a less 
severe stress-cue. W. P. Brown (1961) 
experimentally indicated that sex differ- 
ences in the defense response may be 
quite complex, and Van de Castle 
(1960), using the binocular-rivalry re- 
sponse, reported that sensitizers and 
defenders, as identified by performance 
on the Welsh A and R scales, differed in 
the readiness with which they perceived 
aggressive words when pitted against 
neutral words. These two classes of 
subjects also differed in their responses 
to the Rorschach. 

Subliminal Stimulation and Percep- 
tion and Judgment. The methodologi- 
cal problems that face the experimenter 
when using subliminal stimulation as 
an independent variable are essentially 
those that have been described for 
the literature on subliminal perception. 
Thus, many experiments have been 
arrested at the point of evaluating 
whether or not such stimulation can 
have an effect upon behavior. Mean- 
while, a variety of behavioral phenomena 
has been examined as dependent varia- 
ples. Goldstein (1960) selected six com- 
mon and similar geometrical illusions 
with distorting backgrounds as sub- 
liminal stimuli and required subjects to 
identify the shape of the focal stimuli 
from an ordered set of nondistorted and 
distorted figures. Judgments overall 
tended toward distortion although such 
effects were not always evident for 
specific illusions on selected trials. Here 
the criterion of subliminal stimulation 
was that the subject displayed ignorance 
of the presence of the distorting back- 
ground on posttest inquiry. Similarly, 
Walters, Banks, and Ryder (1959) Te 
ported that the subliminal flashing of 
taboo words prior to the presentation 
of neutral words resulted in distortion 
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or misperception of words that were 
themselves in no way taboo. 
Subliminal stimuli also influence proc- 
esses that are more clearly judgmental 
than perceptual. For example, Smith, 
Spence, and Klein (1959) indicated that 
flashing the words HAPPY or ANGRY prior 
to the suprathreshold presentation of a 
relatively expressionless face resulted in 
the verbal descriptions of the face as 
either pleasant or unpleasant. In a re- 
lated study, Klein, Spence, Holt, and 
Gourevitch (1958) presented drawings 
of sexually ambiguous human figures 
preceded by subliminally flashed draw- 
ings of male or female genitals or male 
or female symbols, Here, subjects were 
required to draw the figures they had 
been presented as well as make adjective 
check list ratings concerned with the 
masculinity-femininity of the figures. 
Subliminal presentation of genital draw- 
ings influenced both check list and draw- 
ing responses toward the sex indicated 
by the flashed figure, but the symbolic 
figures influenced only the masculinity 
or femininity of the drawings, 
genre Stimulation and Guessing 
e Behavior. Closely related to 
the operations of detection and discrimi- 
nation are the functions of guessing and 
choice. The literature on “discrimina- 
tion without awareness” has essentially 
consisted of demonstrations of nonran- 
dom series of responses to stimuli below 
threshold, and, in the recent past, there 
has been a practical interest in the pos- 
sibility that weak or unobtrusive stimu- 
lation would prompt certain kinds of 
choice behavior such as buying refresh- 
ments at the movies or selecting certain 
brands of merchandise above others, 
This latter interest has given rise to a 
number of experiments devised to test 
the claims of subliminal advertising. A 
study by Champion and Turner (1959) 
is representative. During the showing of 
a film on sales administration, a drawing 


of a spoonful of rice with the caption, 
“Wonder Rice,” was flashed periodi- 
cally. Control subjects were shown a 
random array of straight lines on a simi- 
lar schedule. After the film, the slide 
of the spoon was shown and the sub- 
jects were asked to indicate if they had 
ever seen it as advertising and what 
brand they thought it represented. The 
results of the experiment on both counts 
were negative. 

In order to achieve laboratory con- 
trol better, Calvin and Dollenmayer 
(1959) used simplified stimulus ma- 
terials and turned the task into a guess- 
ing game by presenting the subject on 
successive trials with instructions, 
“choose right” or “choose left,” flashed 
subliminally on the screen, Again, as 
long as the instructions remained sub- 
liminal, they failed to influence choice 
behavior. Negative results were also 
obtained by an experiment by Vernon 
and Badger (1959). 

In contrast, Schiff (1961) reported 
positive results when subjects were di- 
rectly engaged in a guessing experiment 
in which the experimental group received 
subliminal flashes of geometrical figures 
to be guessed and the control group got 
blank flashes. Although all presenta- 
tions of the figures were below each 
experimental subject’s lowest measured 
absolute threshold, this experimental pre- 
caution does not entirely compensate for 
the possibility that recognition thresh- 
olds drop with repeated testing and that 
the threshold of verbal indentification 
may not be coextensive with perception. 
Sharp (1959) did a subliminal stimula- 
tion experiment under the guise of test- 
ing subject-matter knowledge. Either 
correct or incorrect answers were pre- 
sented subliminally as the experimental 
condition. While the cues were reported 
to influence performance significantly, 
it was also found that 60% of the sub- 
jects learned to detect their presence 
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consciously. Furthermore, Giddan and 
Eriksen (1959) have demonstrated that 
unobtrusive patterns of reinforcement 
can induce systematic response biases in 
discrimination behavior just as they can 
in verbal conditioning. 

Still another study is that of Spence 
and Bressler (1962) who flashed the 
word House, then a blurred array of 
nonsense words. The subject was then 
presented a list of frequent and infre- 
quent associates to HOUSE, as well as a 
set of control words, and asked if any 
of them were in the blurred array. Dif- 
ferent groups received the key word 
subliminally, near threshold, and supra- 
liminally, Results indicated that reac- 
tion time was negatively correlated with 
associative strength for the subthreshold 
condition, but failed to relate to associa- 
tive strength for the other two condi- 
tions. 

More complex tasks have been used 
to examine the effect of subliminal stim- 
ulation upon cognitive functions. 
Kolers (1957) used metacontrast to pre- 
sent subliminal cues for the solution of 
two types of perceptual problems. In 
one, three sets of geometrical figures 
were flashed simultaneously and the 
subject was to identify a figure com- 
mon to all sets. (In fact, there were 
two figures common to all sets and the 
subliminal cue consisted of one of these.) 
In the other, the subject had to identify 
in one set of embedded figures a figure 
common to all members of another. 
(Here the cue was the embedded figure.) 
In the case of the first task, experimental 
subjects identified the subliminal figure 
as correct more frequently than did the 
controls, In the second task, experi- 
mental subjects solved more problems 
correctly than did the controls. In a 
later study Gerard (1960) replicated 
the Kolers experiment with the added 
condition of subliminal presentation of 


incorrect solutions, His results contra- 


dict those of Kolers and defy ready ex- 
planation, for both experimental groups 
(correct and incorrect cues) made fewer 
correct responses than did the control 
subjects who received no cues. 

The literature which has appeared 
since the McConnell (1958) review con- 
firms its conclusion. The influences of 
subliminal stimulation upon preference 
and choice, if they occur at all, are 
highly subtle and the possibility that 
they could constitute an effective means 
of controlling consumer behavior or 
political opinion is highly unlikely. Simi- 
larly, the technique of incidental rein- 
forcement of the verbal conditioning 
studies appears less potent than the 
early studies of this phenomenon have 
suggested. For example, Krieckhaus 
and Eriksen (1960) studied the effect of 
verbal reinforcement in the choice of 
verbal alternatives. While an effect was 
obtained in subjects unaware of the 
relationship between choice and rein- 
forcement, it was much less than that 
found in subjects aware of this relation- 
ship. Furthermore, the effect produced 
failed to generalize to any other aspect 
of the test stimuli. 

Subthreshold Stimulation and Moti- 
vation or Affect, While it would appear 
that subthreshold stimulation has at best 
an unreliable effect upon choice as a 
generic class of behavior, it is conceiva- 
ble that it may influence motivation or 
affect, variables related to choice be- 
havior. Two recent experiments bear on 
this point. Byrne (1959) demonstrated 
that experimental subjects receiving €x- 
perimental flashes of the word, “beef,” 
during an instructional film rated them- 
selves as hungrier than did control sub- 
jects. Subsequent to the film, the experi- 
mental subjects showed no tendency to 
make verbal references related to the 
subliminal stimulus in sentence comple- 
tion or word association tests and no 
greater tendency to prefer objects re- 
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lated to the key word in a preference 
test. Hence, subliminal stimulation may 
influence drive arousal if not recognition 
thresholds. 

On the other hand, subjects in an ex- 
periment by Goldstein and Davis (1961) 
who were unaware of the words, “good” 
and “bad,” flashed beneath photographs 
failed to be influenced by them whereas 
subjects who saw the words were af- 
fected. 

Subthreshold Stimulation and Elabo- 
rative Thought. Three recent experi- 
ments bear on this problem. All provide 
evidence supporting a subliminal effect. 
In a study in which subjects gave re- 
sponses to TAT cards while negatively 
or positively toned verbal cues were ta- 
chistoscopically presented, Goldstein and 
Barthol (1960) found that when the test 
cards were sufficiently ambiguous, a 
significant influence of the words was 
apparent. Zuckerman (1960), in an ex- 
periment also involving TAT cards, gave 
subliminal instructions to either write 
more or less. While control subjects 
showed a steady increase in amount pro- 
duced for consecutive cards, experi- 
mental subjects showed a decrease with 
a change In Instructions from more to 
less. Pine (1960) had subjects freely 
elaborate stories and found that the con- 
tent of the productions could be in- 
fluenced by a passage heard through the 
wall of an adjacent room. This effect, 
however, was less pronounced than when 
the passage had been read by the sub- 
ject himself. 

Subliminal Stimulation and Autistic 
Responses. Two studies are concerned 
with this relationship. In a 1958 experi- 
ment, Shevrin and Luborsky reported a 
confirmation of the Poetzl phenomenon, 
the tendency for tachistoscopically pre- 

sented material to appear in dreams 
when not available in intentional recall. 


These experimenters found a greater 
number of conceptually similar items in 
dream material than in earlier attemp 
at recall. Johnson and Eriksen (1961), 
questioning the appropriateness of the 
criterion of conceptual similarity as well 
as the use of a cohesive scene as stimulus 
material, did a similar experiment with 
what they regarded as stricter experi- 
mental controls and failed to confirm the 
Shevrin and Luborsky results. 

In Summary, A broad glance across 
these recent data supports the conclu- 
sion that subliminal perceptual effects 
are real effects, though so sensitive in 
production as not to be seriously con- 
sidered as a technique to be exploited in 
the interest of consumer sales or other 
practical goals. It is also clear that 
certain results are artifacts of procedure. 
Meanwhile, part of the problem involves 
a semantic—if not, indeed, a metapsy- 
chological—issue. It is interesting to see 
the classical question of levels of con- 
sciousness reappear in this context. In 
the evolution of concepts, movement of 
thought may be said to transcribe a 
helixlike path, conceptual problems be- 
ing brought full circle but at a higher 
level. Progress in the psychologist’s deal- 
ing with levels of awareness is seen in his 
application of the device of converging 
operations to this problem. But, as we 
have just noted, the problem is in part 
a semantic problem. Thresholds are 
identified in terms of arbitrarily speci- 
fied operations—for example, an in- 
tensity reported 50% of the time—and 
it is perfectly proper to define as sub- 
threshold events that occur reliably but 
that do not meet these definitional cri- 
teria. Meanwhile, what is most im- 
portant for an understanding of behavior 
is not that these effects are unconscious, 
but that stimuli, subtle beyond percep- 
tion in any ordinary sense, can influence 
a variety of response systems, 
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SUBLIMINAL STIMULATION AND JUDGED 
MAGNITUDE 


Pii are, then, three major types of 
effect associated with subliminal stimu- 
lation. One involves the evocation of 
responses that are essentially equivalent 
to those produced by supraliminal stim- 
uli. For example, a local irritant on 
the surface of the skin may produce 
scratching while a subject is preoccupied 
with another activity and unaware of 
his scratching. The second is charac- 
terized by the failure at appearance of 
responses which one otherwise might ex- 
pect to appear. The perceptual defense 
data fall into this category. The third 
type is marked by the induction of a 
change or distortion in the perceptual 
responses to a supraliminal stimulus. 
The final section of this paper is devoted 
to a review of experiments of my own 
dealing with this last effect. 

For 10 years the present writer has 
been interested in exploring the psycho- 
physical model developed by Helson 
(1959) in connection with his adapta- 
tion-level theory. Within the context of 
this theory, the organism is viewed as a 
system for taking inputs from various 
sources—stimuli, their background, and 
past experience—and combining these to 
constitute internal norms or standards 
which become the reference values for 
subsequent judgments. Thus the bright- 
ness of a light, for example, is not sim- 
ply a matter of the physical intensity of 
the light stimulus, but depends also upon 
the brightness of the background on 
which it appears and the impression of 
average brightness evolved by the judge 
from his past experience with lights. 
Preoccupation with the organism as & 
Pooling norm-deriving system forces in- 
terest in the problem of relevance. By 
relevance is meant the criteria that dif- 
ferentiate the inputs which will be pooled 
from those which are ineffective in the 


derivation of particular subjective judg- 
mental norms. Interest in the general 
question of relevance has led us to a 
consideration of many specific problems. 
The pooling model, for example, pro- 
vides a new view of behavioral con- 
stancy. Bevan and Darby (1955) were 
able to show that different frequency- 
intensity patterns of input can, within 
certain limits, yield the same indifference 
point for lifted weights. Several experi- 
ments in the writer’s laboratory have 
indicated that sense modality is not a 
limiting condition of relevance. Bevan 
and Pritchard (1963b) have reported 
that visual anchors can be used to in- 
fluence the judged loudness of tones 
and Behar and Bevan (1961), in an 
extended series on judged duration, 
found that they could not only induce, 
but they could precisely predict, hetero- 
modal anchor effects. 

In the present experiments the writer’s 
interest in relevance was extended to 
include the absolute threshold as a pos- 
sible limiting condition for pooling. Not 
only did Fechner make the magnitude of 
the present stimulus process the sole 
determinant of the corresponding sensory 
attribute, but also, in his formulation of 
the psychophysical law, S=k log R, he 
assumed that the absolute threshold is 
the origin of the sensory scale, a fixed 
and limiting value. Like Fechner, but for 
different reasons, Wundt (Boring, 1929, 
p. 332), and later Titchener (Bevan, 
1958, p. 37), held the absolute threshold 
to be the limiting value of all psycho- 
logically relevant stimuli. This position 
has been retained by Stevens (1936) and 
other psychophysicists in the classical 
tradition, but adaptation-level theorists 
(Michels & Helson, 1949) have shown 
that Fechner’s law can be derived on 
rational grounds with prevailing adapta- 
tion level as the origin of this psycho- 
physical function. The advent of a 
modern functionally oriented psycho- 
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physics, with its emphasis on the pool- 
ing of background and residual stimuli 
with focal stimuli, has made the pos- 
sibility of subliminal effects highly plau- 
sible. The experiments to be reviewed 
now sought to determine if subliminal 
stimuli influence a subject’s responsive- 
ness to supraliminal stimuli. The addi- 
tion of a subliminal stimulus to a supra- 
liminal series fits the paradigm of the 
anchor experiment. Should the sub- 
liminal stimulus behave like a supra- 
liminal anchor below the series, it may 
be expected to produce an elevation in 
judged intensity of the judgments of 
the suprathreshold series members. 
Furthermore, this change should be 
greatest for the stimuli at the lower end 
of the series; that is, closest to the sub- 
liminal anchor, 

The experiments to be described in 
the following paragraphs have two meth- 
odological features in common. In no 
instance has the subliminal stimulus 
evoked a positive response nor was its 
inclusion in the order of presentation 
known. Subjects who responded to one 
or more presentations of the subliminal 
stimulus or who, upon questioning after 
test, indicated cognizance of its presence 
were eliminated from the experiments. 

Shock Experiments, The first experi- 
ment was a pilot study. A small num- 
ber of subjects made category judgments 
of a series of mild electric shocks under 
two conditions, the experimental se- 
quence during which a subliminal shock 
was presented, without the subject’s 
knowledge, at the midpoint of the pres- 

entation interval and the control se- 
quence, without the subliminal anchor. 
When judged intensity of the series was 
plotted against its physical intensities, 
the values for three stimuli under the 
experimental condition lay, as expected, 
above corresponding control values. 
However, the average differences between 
conditions was small and failed statisti- 
cal significance. Meanwhile, there was 


a significant stimuli-by-conditions inter- 
action, indicating a difference in the 
slopes of the functions and suggesting a 
subliminal anchor effect. Further data 
analysis revealed that subjects, regard- 
less of the order of the conditions, tended 
to give more intense judgments for the 
first than for the second condition. Such 
a temporal effect might be expected to 
mask the effects of the experimental 
treatment. Order effects are a common 
bugaboo in psychophysical experiments 
when counterbalancing is used. There- 
fore, the experiment was repeated with 
an independent group for each condition. 

In Experiment II (Black & Bevan, 
1960), GSR measurements were made 
for all series stimuli and the subliminal 
anchor or the midpoint of the presenta- 
tion interval as appropriate, with the 
hope of registering the arousing effects 
of the subliminal stimulus although its 
presence went undetected. Statistical 
comparison of the experimental and con- 
trol functions revealed that the judg- 
ments of the experimental subjects were 
consistently more intense than those for 
the controls. Furthermore, the greatest 
elevation of the experimental curve oc- 
curred in the low end of the series, that 
is, nearest the subliminal stimulus on the 
intensity dimension. Thus, it would ap- 
pear that the subliminal stimulus be- 
haved like an anchor below the series. 
However, there was no evidence of more 
than the chance occurrence of GSRs 
with the presentation of the subliminal 
anchor, suggesting the possibility that 
there are circumstances for which be- 
havioral measurements are more sensi- 
tive indices of stimulation than are 
available physiological measures. It also 
suggests that the brain-stem mechanisms 
of arousal may differ from those asso- 
ciated with anxiety and perceptual de- 
fense. 

Goldstone, Goldfarb, Strong, and Rus- 
sell (1962) have recently repeated this 
last experiment with a change in proce- 
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dure designed to enhance the effective- 
ness of the subliminal anchor. Instead 
of interpolating the anchor at the mid- 
point of the presentation interval be- 
tween series stimuli, this stimulus was 
presented just prior to each series stim- 
ulus after the first. Again, the average 
judged intensity for the experimental 
subjects was consistently higher than 
that for the controls, and the mean dif- 
ference between groups was statistically 
significant. 

Generality of the Shock Results. Be- 
ing satisfied that subliminal shocks do, 
indeed, influence the apparent magnitude 
of series shocks above threshold, the 
next question of concern was whether 
or not this effect is confined to electrical 
stimulation or whether or not it is more 
general in nature. 

Accordingly, Experiment II was rep- 
licated with loudness substituted for 
shock as the dimension to be judged 
(Bevan & Pritchard, 1963a). Because 
the test room was poorly insulated for 
sound, a 70-decibel sound pressure level, 
white-noise sound screen was set up. 
Comparison of the experimental and 
control data revealed both highly signif- 
icant between-group slopes and be- 
tween-group means sources of variance. 
Curves for the two conditions were most 
widely separated, as expected, at the low 
end of the series. But, unexpectedly, 
the relative positions of the curves were 
reversed, thus contradicting both the 
prediction and the results of the early 
experiments. Two possibilities for rec- 
onciliation presented themselves. One 
was that, for the particular set of condi- 
tions selected for the experiment, the 
effect was one of assimilation rather than 
contrast, the anchor attracting rather 
than repelling series judgments. The 
other possibility was that the impressive 
sound screen, while it kept extraneous 
noises from producing anchor shifts, also 
itself served as a potent anchor above 
the series and, perhaps, interacting with 


the anchor designate, produced the un- 
expected results. 

Therefore, the experiment was re- 
peated in a specially constructed sound- 
insulated laboratory, the experimental 
subject, in addition, wearing highly ef- 
fective ear muffs to screen out any ex- 
ternal sounds that might get through 
room insulation. The data from this ex- 
periment were ambiguous. The judg- 
ments of the experimental group fell 
above those of the control for the three 
highest series magnitudes, coincided on 
the next to the lowest magnitude, and 
was below the control value on the weak- 
est of the series stimuli. However, the 
overall difference between the average 
judgments for control and experimental 
conditions was clearly significant. Thus, 
it would appear that if there were an 
anchor effect, it was a weak one. It 
would also appear that the possibility 
of an assimilation rather than contrast 
effect for the stimuli nearest the anchor 
was plausible. 

Ambiguity of these results bred dis- 
content. Hence, yet another experiment 
was performed, the purpose of which 
was twofold: to seek a more effective 
set of conditions for the production of 
the subliminal anchor effect and, as part 
of this, to explore the result of changing 
the temporal position of the anchor 
within the presentation interval. As was 
indicated earlier, Goldstone. Goldfarb, 
Strong, and Russell (1962) reported that 
while an attempt to reproduce the Black- 
Bevan results with the anchor at the 
midpoint of the interval was unsuccess- 
ful, it was possible to do so by moving 
the anchor back in time s0 that it pre- 
ceded the next series stimulus by a 
shorter interval. Following this line of 
reasoning, ted that the 


ambiguity of the last result ight | 
clarified by using several ssc 
stimulus intervals. It was Ss eae 
that the anchor effect might be age : 
by increasing the scalar distance 
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tween anchor and series. At the same 
time, this meant, of course, using a 
weaker physical stimulus as anchor. The 
control group of the last experiment was 
used as the control for the present study. 
In addition, three anchor groups were 
tested. In one, the subliminal anchor 
was presented just prior to each series 
stimulus to be judged, in the second at 
the midpoint of the presentation inter- 
val, and in the third just after each series 
stimulus. 

The results are rather interesting. In 
every case, the experimental group made 
reliably more intense judgments than 
the control. This is the contrast effect 
that was sought. It is consistent with 
the original results of Black and Bevan 
(1960). Furthermore, the position of 
the anchor within the interstimulus in- 
terval seems not to have had any effect 
upon the magnitude of the anchor effect. 
This, of course, does not mean that in- 
terpolation time is not an important 
general consideration, but only that un- 
der the particular experimental condi- 
tions of this last experiment, it failed to 
operate as a significant variable. 

An experiment similar to these audi- 
tory experiments has been reported by 
Boardman and Goldstone (1962). Their 
experimental task involved the judgment 
of the size of discs. Using the visual 
masking technique with a four-field elec- 
tronic tachistoscope, one group of sub- 
jects received the psychophysical series 
along with a subliminal anchor smaller 
than any of the series members, while 

another was presented the series and a 
subliminal anchor larger than the series 
members in size. Each subject served 
as his own control. Although the differ- 
ences between control and experimental 
conditions were small and were not 
significant for all series magnitudes, the 
results were in line with expectation. 
For the group receiving the small anchor, 
the experimental values plotted above 
the control values and the greatest differ- 


ence was at the low end of the series. 
For the group receiving the anchor above 
the series, the control values plotted 
above those of the experimental and the 
greatest spread was at the upper end of 
the series. 

Some Implications of the Data on 
Magnitude Judgment. There are now 
enough data available to prompt the 
conclusion that the subliminal anchor 
effect is a real one, Thus one is faced 
with the responsibility of reflecting upon 
its significance. These results, it ap- 
pears, have implications for two or three 
areas of inquiry. Although more ex- 
perimentation is needed, if the results 
on GSR in the electric shock experiment 
are valid, they suggest the method- 
ological rule of thumb that there are 
situations in which behavioral indices are 
more sensitive measures of mediating 
events than are the usual physiological 
measures. The fact that’ not a single 
GSR occurred to the presentation of 
the subliminal stimulus may imply that 
the brain-stem mechanisms associated 
with arousal and vigilance are distinct 
from those related to anxiety and fear. 
Perhaps, when more is learned of the 
properties of the reticular activity sys- 
tem (Jasper, 1958), these interpretations 
of the present results will receive ex- 
perimental support. 

The question concerning relevance 
which prompted these experiments is, 
of course, answered. The absolute 
threshold cannot be applied as a cri- 
terion in the identification of stimuli 
relevant for pooling. This has implica- 
tions not only for adaptation-level 
theory, but for psychophysics as well, 
for it draws suspicion to the long-stand- 
ing assumption that the traditionally 
defined absolute threshold is the limiting 
value for all sensory dimensions. This 
suggests, in turn, that sensory scales, 
even those constructed with the so-called 
ratio methods, are not absolute but rela- 
tive, since the origin of the scale is not 4 
true zero. 


A 
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Finally, it may be suggested that the 
results of these experiments may have 
some value for an understanding of how 
unconscious mechanisms exert their in- 
fluence upon behavior. The nature of 
the unconscious has been most thor- 
oughly articulated within the context of 
psychoanalytic theory. From this point 
of view, thought goes on at two levels: 
primary thought (Freud, 1946), identi- 
fied with the unconscious, is character- 
ized as essentially dynamic, directed by 
instinctual urges, amoral, and governed 
by a set of laws qualitatively different 
from the conventional logic of second- 
ary, or conscious goal-directed thinking 
(Peck, 1950). Where primary thought 
is directed toward the fulfillment of the 
pleasure principle, secondary thought 
serves the reality principle. The es- 
sential nature of the relationship be- 
tween these two levels is said to be one 
of conflict, conscious thought sitting as 
censor of the unconscious (Jones, 1948). 
The particular mechanism by which un- 
conscious stimuli (and in a formal sense, 
instinctive drives are stimuli) exert their 
influence upon overt behavior is, how- 
ever, unclear. Meanwhile, the general 
impression one gets from the Freudian 
literature is that the relationship is di- 
rect, automatic, and, in the usual sense 
of the term, irrational. The data of 
these experiments join other data in sup- 
port of the view that at least one de- 
vice by which unconscious variables in- 
fluence behavior is through their role in 
structuring the behavioral environment 
within which the behavior they influence 
occurs, Here their role is similar to that 
of supraliminal variables and the be- 
havior, far from being jrrational, is not 
only rational but appropriate to the 
circumstances as the behaving individual 
perceives them. 
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Experiments showing that food intake can be dramatically increased 
(hyperphagia) or even eliminated (aphagia) as a result of brain lesions 
or of brain stimulation are described. The most widely accepted explana- 
tion of how the brain controls feeding behavior, Anand and Brobeck’s 
dual-center theory of hypothalamic regulation, docs not account satis- 
factorily for all the data, but it does account for a great deal. Experi- 
ments are reviewed in which food intake was altered by lesions or by 
stimulation outside of the hypothalamus, and it is suggested that the 
hypothalamus is part of a larger “feeding system” that also includes 
parts of the thalamus, temporal lobe, frontal lobe, and midbrain. 


Mohr (1840) published what is gen- 
erally considered to be the first clinical 
report of pathological obesity in man re- 
sulting from a tumor at the base of the 
brain (cited by Brobeck, 1946). There- 
after, although similar cases appeared in 
the literature from time to time (see, for 
example, Paget, 1897), no particular in- 
terest was shown in localizing the effec- 
tive lesion until the early 1900s, when 
a heated dispute arose as to whether 
damage to the Pituitary or to the hypo- 
thalamus was responsible. Numerous re- 
ports of experimentally produced hypo- 
thalamic obesity in animals appeared in 
the literature between 1920 and 1940, 
but most of these (with the Possible ex- 
ception of Bailey & Bremer’s work, 
1921) were received with skepticism be- 
cause the operative procedure used made 
it unlikely that the pituitary had been 
completely spared. The difficulty was 
not overcome until the early 1930s, when 
the method of making discrete electro- 
lytic lesions by means of the stereotaxic 
instrument came into general use. At 
that time, several preliminary reports 
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appeared in which this method was used 
(Hetherington & Ranson, 1939; Ranson, 
Fisher, & Ingram, 1938), but the most 
conclusive demonstration that hypotha- 
lamic lesions alone were responsible for 
obesity was furnished by Hetherington 
and Ranson (1940)—exactly 100 years 
after the publication of Mohr’s article. 
Subsequently, the possibility that pitui- 
tary damage played even a partial role 
was ruled out by hypophysectomizing 
rats before (Hetherington, 1943) and 
after (Hetherington & Ranson, 1942a) 
hypothalamic lesions. The results showed 
that obesity occurred after hypothalamic 
lesions whether the pituitary was intact 
or not, and removal of the pituitary in no 
way altered the hypothalamic symptoms. 

Once the pituitary-hypothalamic con- 
troversy had been resolved, most in- 
vestigators interested in the problem of 
neural control of food intake concen- 
trated their efforts on elaborating the 
hypothalamic syndrome, and it is only 
quite recently that systematic attempts 
have been made to explore the effects on 
feeding behavior of other types of brain 
injury. This review will begin, there- 
fore, with a description of the effects of 
hypothalamic injury, and the effects of 
extrahypothalamic lesions will be de- 
scribed in a separate section, 
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EFFECTS or HYPOTHALAMIC LESIONS 
ON HUNGER 


Hyperphagia 


The hypothalamic lesions made by 
Hetherington and Ranson (1940) in 
their pioneer experiment were large, but 
in a further series of experiments, Heth- 
erington showed that obesity could be 
produced by destroying only a small 
hypothalamic area—the ventromedial 
nucleus and the area immediately ad- 
jacent to it (Hetherington, 1941, 1944; 
Hetherington & Ranson, 1942b). This 
finding was quickly confirmed by other 
investigators, and it was shown to be 
true for several animal species—rats 
(Tepperman, Brobeck, & Long, 1941), 
cats (Wheatley, 1944), and monkeys 
(Brooks, Lambert, & Bard, 1942). In 
mice, obesity following ventromedial hy- 
pothalamic lesions has been demon- 
strated by two different methods—elec- 
trolytic lesions (Mayer, French, Zighera, 
& Barrnett, 1955) and goldthioglucose 
poisoning (Marshall, Barrnett, & Mayer, 
1955). 

It was fairly soon established that the 
cause of hypothalamic obesity was over- 
eating or hyperphagia. Hetherington 
(1941) was inclined at first to attribute 
the weight gain to underactivity and 
metabolic disturbances, but paired feed- 
ing experiments with normal controls as 
well as measurements of food intake, 
metabolic rate, and activity level showed 
that the changes in activity and metabo- 
lism were minor by comparison with the 
enormous increase in food intake (Bro- 
beck, 1946; Brobeck, Tepperman, & 
Long, 1943; Brooks, 1946; Brooks, Ma- 
rine, & Lambert, 1946; Tepperman et 
al., 1941). With electrolytic lesions, the 
hyperphagia is apparent as soon as the 
animal comes out of the anesthetic (Bro- 
beck et al., 1943; Brooks, Lockwood, & 
Wiggins, 1946), and it continues for a 
period of several months (dynamic 


phase) during which the animal may 
double its original weight. Eventually, 
a plateau is reached (static phase) and 
the food consumption tapers off to nor- 
mal. Obese animals can be dieted down 
to a normal weight, but they again be- 
come obese when fed ad lib. (Brobeck 
et al., 1943; Brooks & Lambert, 1946; 
Kennedy, 1950). During the dynamic 
phase, resection of most of the stomach 
does not appreciably alter the hyper- 
phagia (Brooks, Lockwood, & Wiggins, 
1946). Additional lesions of the lateral 
hypothalamus, however, not only elimi- 
nate the hyperphagia but also cause the 
animal to stop eating altogether (Anand 
& Brobeck, 1951b). 

Once it had been established that ani- 
mals with ventromedial hypothalamic 
lesions become obese primarily because 
they overeat, it became important to 
specify the cause of the overeating, and 
this proved to be more difficult. Brobeck 
(1946) has suggested that these lesions 
disturb some hypothalamic mechanism 
which normally maintains a balance be- 
tween energy need and food intake, but 
this is not so much an explanation as a 
description of a generally recognized phe- 
nomenon—hyperphagic animals are de- 
ficient in the ability to adjust to changed 
caloric requirements. Thus, although 
they are less active than normal animals 
(Hetherington, 1941; Hetherington & 
Ranson, 1942c), hyperphagic animals 
that are in the dynamic phase consume 
about twice as much food as normals 
(Tepperman et al., 1941). Also, hyper- 
phagic animals that are in the static 
phase (when food consumption is ap- 
proximately normal), if fed a diet adul- 
terated with some inert substance such 
as kaolin or cellulose (Kennedy, 1950; 
Larsson & Ström, 1957; Teitelbaum, 
1955) or subjected to low temperatures 
(Fregly, Marshall, & Mayer, 1957), do 
not compensate for the decreased calories 
or the increased need for heat produc- 
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tion by increasing their food intake as 
normals do under these conditions. It 
should be noted, however, that the fail- 
ure of obese hypothalamic hyperphagic 
animals to eat more of a cellulose adul- 
terated diet can also be explained on the 
grounds that hyperphagic animals are 
more responsive than normals to altera- 
tions in the taste and texture of food; 
this will be discussed in more detail later, 
The overeating which follows ventro- 
medial hypothalamic lesions has also 
been attributed to a change in emotion- 
ality. Wheatley (1944) pointed out that 
lesions in this area make animals savage; 
this prompted Brooks, Lockwood, and 
Wiggins (1946) to suggest that the vora- 
ciousness of hypothalamic hyperphagic 
animals was actually a substitute for at- 
tack, but the explanation does not seem 
to have been generally accepted. It does 
not account for the overeating which re- 
sults from lesions in other areas such as 
the caudal hypothalamus and caudal 
thalamus where taming effects have been 
noted (Brobeck, 1946: Hetherington & 
Ranson, 1942c; Patton, Ruch, & Walker, 
1944), and this objection was cited by 
Brooks (1947) a year later when he re- 
pudiated his own earlier position. 
. Perhaps the most popular explanation 
is that hyperphagia is a “release phe- 
nomenon.” Hyperphagic animals overeat 
not because they are “hungrier” but be- 
cause, lacking a “satiety” mechanism, 
they are unable to stop eating once they 
have started (Anand & Brobeck, 1951a; 
Kennedy, 1950; Miller, Bailey, & Ste- 
venson, 1950; Teitelbaum, 1955). How 
this satiety mechanism operates in nor- 
mal animals is not generally agreed upon, 
According to Brobeck (1957), the inges- 
tion of food causes an almost immediate 
increase in heat production which, in 
turn, causes thermal receptors in the 
hypothalamus to inhibit eating. Other 
investigators, however, have suggested 
that the control of food intake depends 
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on chemoreceptors situated in the hypo- 
thalamus which are sensitive to changes 
in blood composition (Kennedy, 1953; 
Mayer, Vitale, & Bates, 1951). What- 
ever the mechanism involved, the evi- 
dence for a satiety center derives from 
a number of sources, and these will be 
considered separately. 

One line of evidence concerns the dif- 
ference between the eating patterns of 
normal and hypothalamic hyperphagic 
animals. Increased consumption by hy- 
perphagic animals is reflected mainly in 
meals which are larger than normal 
(Brooks, Lockwood, & Wiggins, 1946; 
Larsson & Strém, 1957; Teitelbaum & 
Campbell, 1958). On a solid or liquid 
diet, hyperphagic animals, unlike nor- 
mals, consume as much during the day 
as at night (Anliker & Mayer, 1956; 
Brooks, 1946; Teitelbaum & Campbell, 
1958), and, on a diet of wet mash, hy- 
perphagic rats eat most just after the 
food has been changed regardless of the 
hour (Brobeck et al., 1943; Brooks, 
Lockwood, & Wiggins, 1946). 

A second line of evidence is related to 
the dissociation between two measures of 
food drive in hypothalamic hyperphagic 
animals. These animals do not work as 
hard as normals on food motivated tasks, 
even when the deprivation is extended 
up to 96 hours, but they overeat when 
fed ad lib. (Ingram, 1958; Miller et al., 
1950; Teitelbaum, 1957), and these 
findings have been interpreted to mean 
that ventromedial hypothalamic lesions 
lower food drive but impair the mecha- 
nism which inhibits eating. It is inter- 
esting to note that the degree of the 
performance deficit is a function of the 
amount of work involved. In an experi- 
ment in which food was obtained by 
pushing the lid off a box, hyperphagic 
animals ate more than normals when the 
lid was unweighted but less than normals 
when the lid was weighted (Miller et al., 
1950). In another experiment in which 
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the total daily food supply was obtained 
by bar pressing during a 12-hour period 
in a Skinner box, hyperphagic rats 
pressed more and obtained more food 
than normals when the ratio of nonrein- 
forced to reinforced responses was low, 
but they pressed less and obtained less 
food when the ratio was high (Teitel- 
baum, 1957). 

According to a third line of reasoning, 
the “release” brought about by ventro- 
medial hypothalamic lesions consists of 
an overreactivity to stimuli associated 
with feeding. Thus, stomach preloads 
(Smith, Salisbury, & Weinberg, 1961) 
and appetite depressant drugs (Epstein, 
1959; Reynolds, 1959) significantly 
lower food intake in both hyperphagic 
and normal rats, but the effects of such 
procedures are more marked in hyper- 
phagic than in normal animals. Hyper- 
phagic animals also overreact to the 
quality of the diet offered them. Hether- 
ington and Ranson (1942c) noted that 
hyperphagic animals gained more on a 
soft diet than on pellets; Miller et al. 
(1950) observed that a high fat syn- 
thetic diet significantly increased the 
food intake of operate rats that had not 
appeared to be hyperphagic on a diet of 
dry chow; and Teitelbaum (1955) re- 
ported that the addition of dextrose to a 
powdered diet significantly increased the 
food intake of obese hyperphagic animals. 
On the negative side, the addition of 
quinine to the diet in an amount which 
did not disturb normal animals resulted 
ina significant decrease in food intake 
by obese hyperphagic rats (Miller et al., 
1950; Teitelbaum, 1955). The rejection 
by obese hyperphagic animals of diets 
adulterated with cellulose or kaolin can 
also be explained on this basis, as noted 
earlier. 

„One qualifying point should be noted 
with respect to the extreme sensitivity 
of hyperphagic animals to the quality of 
the diet—animals that have not yet be- 


come obese are less responsive than obese 
animals to the taste and texture of food. 
Kennedy (1950) found that the addition 
of kaolin to the diet was readily compen- 
sated for by recent operates but not by 
rats that had already become obese; and 
Teitelbaum (1955) found that, whereas 
both groups overreacted to cellulose 
adulteration, the decrease in food intake 
was more extreme in the obese group. 
Similarly, the addition of quinine to the 
diet in an amount which caused the 
obese group to limit its intake severely 
had as little effect on the nonobese as 
on the normal control group (Teitel- 
baum, 1955). Thus, the hyperreactivity 
of the hyperphagic animal to the quality 
of the diet is in some way related to the 
state of the animal’s fat deposits. 


A phagia 


We turn now to a group of experi- 
ments in which hypothalamic lesions 
produced a totally different effect on 
eating. Injury to the lateral hypothala- 
mus at the same rostrocaudal level as the 
ventromedial nucleus has been found to 
result in a condition known as “apha- 
gia,” which is characterized by refusal to 
eat and loss of weight. In many cases, 
death by starvation occurs. The change 
in eating behavior does not appear to be 
attributable to a decrease in activity 
(Anand & Brobeck, 1951a). Aphagia, 
like hyperphagia, has been demonstrated 
in a variety of animal species—rats and 
cats (Anand & Brobeck, 1951a, 1951b; 
Teitelbaum & Stellar, 1954), monkeys 
(Anand, Dua, & Shoenberg, 1955), and 
chickens (Feldman, Larsson, Dimick, & 
Lepkovsky, 1957). Within the lateral 
hypothalamus, little damage is required 
to produce aphagia, and it is sometimes 
sufficient merely to lower an electrode 
without turning on the current (Mor- 
gane, 1961c; Morrison & Mayer, 1957a, 


1957b). 


When hypothalamic aphagia was first 
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described, the failure to eat spontane- 
ously appeared to be permanent and it 
was thought to result from destruction 
of an extremely limited area (Anand & 
Brobeck, 1951a, 1951b). However, re- 
cent work has shown that neither of 
these original observations was entirely 
accurate. Anand and Brobeck (1951b) 
found the effective area within the lateral 
hypothalamus to be so circumscribed 
that deviations by 1 mm. or less in any 
direction from their original electrode 
placement produced only a temporary 
depression in food intake which did not 
lead to death. Morgane (1961d) has 
verified their findings with respect to 
lesions placed anterior or posterior to 
the tuberal level, but he finds that, at 
the tuberal level, the effective region is 
more extensive, both dorsally and later- 
ally, than was originally supposed. Mor- 
gane (1961a) finds, further, that aphagia 
also results when the lesions are at the 
same level but are outside the hypothala- 
mus and are restricted to the globus pal- 
lidus. Current research has also modified 
earlier notions concerning the perma- 
nence of the effect. Although lateral hy- 
pothalamic lesions can lead to death by 
starvation, recovery is possible, provided 
the animals are tube fed during the pe- 
riod (which may be several months) 
when they refuse to eat spontaneously 
(Morrison & Mayer, 1957a; Teitelbaum 
& Stellar, 1954). Teitelbaum and his co- 
workers have studied the recovery pat- 
tern in a series of experiments (Epstein 
& Teitelbaum, 1960; Teitelbaum & Ep- 
stein, 1962; Teitelbaum & Stellar, 1954; 
Williams & Teitelbaum, 1959) and they 
note that recovery involves four separate 
stages. In the first, the animal does not 
eat or drink; in the second, it eats small 
quantities of preferred foods but does 
not drink; in the third, it eats normally 
but still refuses water ; and in the fourth, 


it eats and drinks normally. 


Dual-Center Theory 


The most widely accepted explanation 
of the drastic and apparently opposite 
effects of medial and lateral hypotha- 
lamic lesions is the dual-center theory 
proposed by Anand and Brobeck (1951a, 
1951b; see also Anand et al., 1955). 
According to the theory (which is an 
expansion of the concept of the “satiety 
mechanism” previously outlined), the 
lateral hypothalamus is the site of an 
“excitatory” or “feeding” center which 
initiates the feeding response when the 
animal is stimulated by the sight or smell 
of food, and the ventromedial nucleus is 
the site of a corresponding “inhibitory” 
or “satiety” center, which stops the re- 
sponse when the animai has satisfied its 
physiological need for food. Although 
the theory cannot account satisfactorily 
for all the data, it does account for a 
great deal. 

Perhaps the strongest support for the 
dual-center theory comes from the find- 
ing that ablation and electrical stimula- 
tion of the ventromedial nucleus and of 
the lateral hypothalamic area have re- 
ciprocal effects, Thus, removal of the in- 
hibitory center results in an increase in 
food consumption (Hetherington & Ran- 
son, 1940), but stimulation of the same 
area results in a decrease in food con- 
sumption (Olds, 1958; Smith, 1956) and 
in the inhibition of a previously learned 
instrumental response motivated by food 
reward (Wyrwicka & Dobrzecka, 1960). 
Similarly, ablation of the excitatory cen- 
ter causes cessation of eating (Anand & 
Brobeck, 1951a, 1951b), but its stimula- 
tion causes increased food intake (Del- 
gado & Anand, 1953; Larsson, 1954; 
Olds, 1958; Smith, 1956). Alterations 
in food intake, along the lines predicted 
by the theory, have also been produced 
by injection of drugs directly into the 
hypothalamus. This method is of par- 
ticular interest since it produces more 
specific effects than electrical stimula- 
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tion; it is possible, for example, by in- 
jecting the appropriate drug at the same 
hypothalamic site, to alter not only the 
type of response (i.e., eating versus 
drinking; Grossman, 1960) but also the 
direction of the response (i.e., increase 
versus decrease in food intake; Epstein, 
1960). 

Additional support for the theory 
comes from the finding that the eating 
elicited by stimulation of the lateral hy- 
pothalamus is motivated and not merely 
reflexive. Satiated animals, whether 
stimulated electrically or chemically, will 
perform a variety of instrumental re- 
sponses to get food. Coons and Miller 
(cited by Miller, 1957) trained rats to 
push back a hinged door; Grossman 
(cited by Miller, 1960) trained rats to 
bar press; Morgane (1961b) trained rats 
to cross an electrified grid; and Wyr- 
wicka, Dobrzecka, and Tarnecki (1959) 
trained goats to place the foreleg on a 
food tray. Wyrwicka and Dobrzecka 
(1960) further showed that, once this 
conditioned response had been elicited 
in satiated goats by stimulation of the 
lateral hypothalamus, it could then be 
inhibited by subsequent stimulation of 
the ventromedial nucleus. 

Both Anand (Sharma, Anand, Dua, & 
Singh, 1961) and Brobeck (Brobeck, 
Larsson, & Reyes, 1956) have been 
associated with attempts to demonstrate 
the existence of a dual hypothalamic 
feeding mechanism in yet another way, 
by recording electrical activity in the 
medial and lateral hypothalamus under 
varying conditions of hunger and satiety, 
but these studies have not furnished un- 
ambiguous support for the theory. On 
the basis of their results, Brobeck and 
his co-workers concluded that ampheta- 
mine, a known appetite depressant, acts 
upon the medial hypothalamus. How- 
ever, subsequent attempts by bath Ep- 
stein (1959) and Reynolds (1959) to 


demonstrate this behaviorally failed. 


Both investigators found that the drug 
suppressed eating in hypothalamic hy- 
perphagic rats as well as in normals. 
The evidence furnished by Sharma et al. 
(1961) appears to be more convincing, 
but it should be noted that, in the latter 
experiment, as well as in the one by Bro- 
beck and his group, the data consist of 
limited selections from raw electroen- 
cephalogram records; in the absence of 
quantification, what constitutes a “sig- 
nificant change” in electrical activity 
may be a matter of dispute. 

Perhaps the major objection to the 
dual-center theory is that it does not en- 
tirely account for the complexity of the 
changes that follow ventromedial and 
lateral hypothalamic lesions. Consider, 
for example, the problem of what hap- 
pens when an animal “recovers” from 
hyperphagia. The lateral hypothalamus 
is believed to be the more important of 
the two centers, and it is usually as- 
sumed that, when the hyperphagic ani- 
mal enters the static phase, inhibition of 
feeding is mediated by undamaged tissue 
in this area (Brobeck, 1957). That the 
new control over food intake is not 
simply a matter of vicarious function 
following a more or less fixed recovery 
period is shown by a number of facts— 
if animals are prevented from overeating 
for a period of several months following 
operation, they become obese when 
placed on an unrestricted feeding sched- 
ule (Brobeck et al., 1943); hyperphagia 
can be reinstated in animals that have 
presumably recovered by dieting the ani- 
mals down to a normal weight level 
(Brobeck et al., 1943); and, when ad 
lib. feeding is resumed, the hyperphagic 
tapers off at a higher weight level than 
previously (Brooks & Lambert, 1946). 
That body weight somehow plays @ role 
in the regulation of food intake in hypet- 
phagics has already been noted, but how 
it interacts with a central inhibitory 
mechanism is not clear. It should also 
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be noted that Anand and Brobeck’s 
(1951a, 1951b) reason for assuming the 
lateral center to be more basic was that 
aphagia resulted when both medial and 
lateral destruction occurred in the same 
animal, and this was true whether the 
medial area was destroyed first or the 
two areas were destroyed simultaneously. 
Yet, it has recently been shown that, 
when such animals recover from aphagia, 
they become obese (Teitelbaum & Ep- 
stein, 1962). What determines this se- 
quence of events is, as yet, unexplained. 
Another point to be considered is that 
the aphagia that occurs following lateral 
hypothalamic lesions is not an all-or- 
none phenomenon, as one would expect 
if the lesions simply removed an excita- 
tory center, Instead, whether recently 
operated animals starve to death or 
maintain their normal weight levels de- 
pends upon whether they are offered dry 
food or a liquid diet (Williams & Teitel- 
baum, 1959), Even after they have pre- 
sumably recovered, such animals will 
Sometimes refuse quinine adulterated 
foods at dosage levels accepted by nor- 
mal animals (Teitelbaum & Epstein, 
1962); the fact that the same behavior 
has been noted in recovered hyper- 
phagics (Teitelbaum, 1955) suggests 
that there is some underlying relation- 
ship between hyperphagia and aphagia. 
It is conceivable that they represent 
different degrees or different stages of 
the same disorder—an extreme sensitiv- 
ity to the taste and texture of food. 
Morgane (196la, 1961b, 196 1c, 
1961d) has criticized Anand and Bro. 
beck’s dual-center theory on other 
grounds. He suggests that the lateral 
hypothalamus contains not one feeding 
center but two—a midlateral “motiya- 
tional” system, dependent upon the in- 
tegrity of the medial forebrain bundle, 
and a more basic far-lateral “metabolic” 
system, dependent upon the integrity of 
pallidofugal fibers. The distinction is 
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based, in part, on:Morgane’s supposed 
demonstration of a nonreversible aphagia 
following far-lateral but not following 
midlateral lesions. Actually, as Teitel- 
baum and Epstein (1962) have shown, 
all animals with lateral hypothalamic 
lesions recover eventually. However, the 
recovery period is longer after far-lateral 
lesions, and some animals with such le- 
sions never drink water spontaneously. 
Morgane’s findings were apparently due 
to his failure to distinguish between two 
effects—loss of spontaneous eating and 
failure to eat because of lack of adequate 
hydration. The other basis on which 
Morgane claims to distinguish between 
motivational and metabolic centers is 
his own demonstration that far-lateral 
stimulation elicits both feeding and food- 
motivated behavior only as long as the 
medial forebrain bundle is intact. With 
the medial forebrain bundle sectioned, 
such stimulation elicits feeding but not 
food-motivated behavior. In these ex- 
periments, the area stimulated was ex- 
tremely close to the damaged tissue and 
the effect could have been due to this 
factor alone, a possibility for which 
Morgane has not furnished suitable con- 
trols. Despite the fact that there does 
not seem to be sufficient evidence for 
two feeding centers in the lateral hypo- 
thalamus, Morgane’s data are of con- 
siderable interest to psychologists, es- 
pecially since he has furnished a more 
careful anatomical analysis of the lateral 
hypothalamic syndrome than has been 
attempted to date. 

A final and less serious objection to 
the dual-center theory is that the only 
parts of the hypothalamus that are sup- 
posed to function in the regulation of 
food intake are the ventromedial and 
lateral parts at the tuberal level. Tt will 
be noted that the present discussion has 
been confined, so far, to experiments 
dealing with injuries or stimulation at 
this level, but there are a few experi- 
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ments implicating other parts of the hy- 
pothalamus in feeding behavior, and 
these will now be described briefly. 
Lesions placed rostral to the ventro- 
medial hypothalamic nucleus produce 
obesity in dogs (Biggart & Alexander, 
1939; Heinbecker & White, 1942; Hein- 
becker, White, & Rolf, 1944), although 
they have no effect on food intake in 
rats, cats, or monkeys (Anand & Bro- 
beck, 1951a; Anand et al., 1955; Heth- 
erington, 1944). With respect to lesions 
placed in the caudal hypothalamus, the 
evidence is confusing. There are several 
studies in which lesions in the region of 
the mammillary bodies produced apha- 
gia (Clark, Magoun, & Ranson, 1939; 
Ingram, Barris, & Ranson, 1936; Ran- 
son, 1939), On the other hand, Hether- 
ington and Ranson (1942b) produced 
obesity in rats by ablating a part of the 
lateral hypothalamus dorsolateral to the 
mammillary bodies, and they attributed 
their results to the destruction of fibers 
from the ventromedial nucleus which 
pass through this region on the way to 
the midbrain. In other studies, caudal 
hypothalamic lesions in rats and cats 
(Anand & Brobeck, 1951a) and in cats 
and monkeys (Anand et al., 1955) pro- 
duced no change in food intake. To con- 
fuse the issue further, in one study in 
which caudal hypothalamic lesions elimi- 
nated spontaneous eating, it was reported 
that the animals chewed and swallowed 
voraciously if food was forced into their 
mouths (Ingram et al., 1936). . 
Recent studies utilizing the electrical 
stimulation technique have also impli- 
cated parts of the hypothalamus not pre- 
viously believed to be involved in the 
regulation of food intake, and here the 
evidence is more consistent. Eating 1n 
satiated animals was induced by stimu- 
lating the preoptic region (Robinson & 
Mishkin, 1962) and the caudal hypo- 
thalamus in the vicinity of the mammil- 
lary bodies (Larsson, 1954; Maire, 
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1956; Ruch, Maire, & Patton, 1956). 
Also, cats stimulated in the dorsal hypo- 
thalamic nucleus or in the paraventricu- 
lar hypothalamic nucleus performed a 
learned response to obtain food (Gras- 
tyan, Lissdk, & Kékesi, 1956). Some of 
these data are inconclusive because no 
histological verification of placements 
has been presented. It is also possible 
that some of the results can be “ex- 
plained away” on the basis of current 
spread to the feeding or satiety centers. 
Nevertheless, until the issue is resolved 
by further research, the possibility that 
other parts of the hypothalamus func- 
tion in the regulation of food intake can- 
not be entirely ignored, 


EFFECTS OF EXTRAHYPOTHALAMIC 
LESIONS ON HUNGER 


In this section, only those instances of 
increased or decreased food intake fol- 
lowing brain lesions will be reported that 
bear some resemblance to hypothalamic 
hyperphagia or aphagia. Omitted are 
reports of increased food intake which 
occur in conjunction with increased 
energy requirements, as in the case of 
frontal lobe lesions (e.g., Richter & 
Hawkes, 1939), as well as reports of 
decreased food intake in which the be- 
havioral descriptions suggest that motor 
or intellectual deficits were responsible 
for the feeding change (e.g., Bailey & 
Davis, 1942; Kennard, 1955). It should 
be noted that a great deal of the mate- 
rial which follows was derived from ex- 
periments in which feeding changes were 
reported only incidentally; and, conse- 
quently, in many cases, systematic rec- 
ords of food intake and of weight gain 
were not reported. This admittedly lim- 
its the conclusions that can be drawn 
about the effects of lesions in any one 
area but, taken as a group, these experi- 
ments are important because they call 
attention to a topic which has not yet 
been adequately explored, the part 
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played by areas other than the hypo- 
thalamus in the regulation of food in- 
a caatins of the dorsomedial tha- 
lamic nucleus reportedly produces a syn- 
drome which is virtually identical to the 
one that follows ventromedial hypotha- 
lamic injury. Cats subjected to such an 
operation were found to be hyperphagic, 
irritable, and underactive. Also, as was 
the case with ventromedial hypothalamic 
animals, there seemed to be a relation 
between eating and emotionality. When 
the cats were subjected to a mild stress, 
such as having their tails pinched, they 
gulped food. 
It appeared as though the animals released 
their savage attacks upon the meat rather than 
turning to attack the offending stimulus as 
would be done, in most instances, by un- 


operated cats (Schreiner, Rioch, Pechtel, & 
Masserman, 1953, p. 239]. 


Posterior thalamic lesions extending into 
the rostral mesencephalon also increase 
eating, although, in this case, the feeding 
change is accompanied by tameness 
rather than by savageness. In the course 
of a series of experiments aimed at estab- 
lishing the arcuate nucleus as the tha- 
lamic relay for taste, Ruch and his co- 
workers noted incidentally that a few 
operate monkeys that showed only a 
mild and transient taste deficit tripled 
their caloric intake and ultimately be- 
came obese (Patton et al., 1944; Ruch, 
Blum, & Brobeck, 1941; Ruch, Patton, 
& Brobeck, 1942). Lesions confined to 
the rostral mesencephalon may also pro- 
duce obesity provided they are suffi- 
ciently extensive to interrupt most of 
the lateral sensory tracts (Sprague, 
Chambers, & Stellar, 1961). Small le- 
sions in the same area have a somewhat 
different effect—ad lib. food intake re- 
mains unchanged but motivation to work 
for food is substantially decreased (Ehr- 


lich, 1960). 
i The notion that temporal lobe struc- 


tures might be involved in the regulation 
of food intake has received a fair amount 
of attention in recent years. The idea 
is an old one, however, and dates back at 
least as far as 1888, when Brown and 
Schäfer noted that one monkey with a 
complete bitemporal lobectomy was a 
hyperphagic and indiscriminate feeder. 
Over a half century later, Terzian and 
Ore (1955) observed the same effect in 
the case of a human subjected to a simi- 
lar operation: 

Some days after the operation the patient 
demonstrated an insatiable appetite and ate at 
least as much as four normal persons. He pre- 
pared himself for meals as for a ceremony. 
He would personally go to the kitchen and 
insistently ask for food. at any hour. He 
would look for a secluded corner far from 
anyone, eat everything voraciously without 
preference for any certain food, lick the dish 
incessantly, and after 15 minutes asked [sic] 
for more food [p. 375]. 


Hyperphagia results also from more 
restricted damage to the temporal lobe. 
This was first shown by Brown and 
Schäfer (1888), who removed only the 
superior temporal gyri in one monkey. 
Recently, the effect has also been pro- 
duced by ablation of the entorhinal area. 
Phalangers, which are normally noc- 
turnal feeders, were found to eat raven- 
ously at any time of day after operation; 
the change was only temporary, however, 
and disappeared entirely within about 
2 weeks (Adey, Merrillees, & Sunder- 
land, 1956). 

The temporal lobe structure most 
often implicated in feeding changes is 
the amygdala and, with some exceptions 
(Anand & Brobeck, 1952; Kling & 
Schwartz, 1961; Schreiner & Kling, 
1953), the effect commonly produced is 
hyperphagia. This change has been ob- 
served not only following operations in 
which the amygdala was damaged along 
with other temporal lobe structures but 
also following operations restricted to 
the amygdala, and it has been reported 
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in a variety of species—monkeys 
(Schwartzbaum, 1961), cats (Green, 
Clemente, & de Groot, 1957; Morgane 
& Kosman, 1957, 1959; Wood, 1958), a 
baboon (Adey, 1958), and dogs (Fuller, 
Rosvold, & Pribram, 1957). In the latter 
case, however, the change was tempo- 
rary. The rat is the only common labo- 
ratory animal in which hyperphagia has 
not been reported following amygdalec- 
tomy; instead, the operation seems to 
produce an aphagia of variable duration 
(Anand & Brobeck, 1952; Kling & 
Schwartz, 1961). 

Presently available evidence suggests 
that amygdaloid hyperphagia is not as 
striking a phenomenon as hypothalamic 
hyperphagia. In the only studies to 
date in which long-term records were 
kept of preoperative and postoperative 
food intake and weight gain, Morgane 
and Kosman (1957, 1959) found that, 
while amygdalectomized cats ate sig- 
nificantly more food and gained signifi- 
cantly more weight than normal controls, 
the increases in weight did not go beyond 
50% of the preoperative levels; hy- 
pothalamic hyperphagics, on the other 
hand, frequently double their initial 
weight (Tepperman et al., 1941). The 
dissociation between consummatory and 
motivated behavior that occurs in ani- 
mals made hyperphagic by ventromedial 
hypothalamic lesions has been shown 
to occur also in animals made hyper- 
Dhagic by amygdaloid lesions. Schwartz- 
baum (1960, 1961) has reported that 
amygdalectomized monkeys that were 
hyperphagic on an ad lib. diet were 
relatively insensitive to increasing 
amounts of food deprivation and to 
shifts in the size of the food reward in 
a situation in which they were required 
to bar press for food. 

Further evidence concerning the role 
played by extrahypothalamic structures 
in the control of feeding comes from 
some recent studies utilizing electrical 


stimulation. It is interesting to note that 
eating was elicited by neocortical stim- 
ulation in only one instance (Delgado, 
1952); all the other cases in which feed- 
ing behavior was altered involved stim- 
ulation of structures belonging to, or as- 
sociated with, the limbic system. Thus, 
eating in satiated animals has been 
elicited by stimulation of the olfactory 
area, anterior thalamus, septum, cingu- 
late gyrus, hippocampal commissure, 

amygdala, and midbrain (Grimm, 1960; 

Miller, 1960; Robinson & Mishkin, 

1962; Smith, McFarland, & Teitelbaum, 

1961; Wyrwicka & Doty, 1962). Stim- 

ulation of different parts of some of these 

structures has also been observed to 

inhibit food intake (Fonberg & Delgado, 

1961; Robinson & Mishkin, 1962). 


FEEDING SYSTEM 


When Bailey and Bremer (1921) 
described the effects of hypothalamic 
lesions on food intake, little was known 
about this structure, and they were care- 
ful to specify that it was a “region 
known to anatomists as the hypothal- 
amus [p. 774].” They enthusiastically 
predicted that “the time is not far 
distant when the neuropathologist will 
no more think of omitting to examine 
the hypothalamus than he would the 
motor cortex [p. $02].” Their predic- 
tion was accurate, and currently the 
pendulum has swung 50 far the other 
way that the hypothalamus has been 
credited with containing excitatory and 
inhibitory centers not only for feeding 
but for each type of motivation (Stellar, 
1954). 

From the studies reviewed in the pre- 
ceding section, it is apparent that 
changes in food intake occur as & result 
of ablating or stimulating many parts 
of the brain, and it would seem, there- 
fore, that the hypothalamus is part of a 
larger “feeding system” that also includes 
parts of the thalamus, temporal lobe, 
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frontal lobe, and midbrain. (Of course, 
it is also possible that more than one 
system is involved, with each one func- 
tioning more or less independently. 
However, since current research has not 
progressed far enough to make this a 
serious possibility, the more parsimo- 
nious explanation, in terms of a single 
system, is preferable.) Whether one 
takes the further step of viewing the 
hypothalamus as a unique part of this 
system is another matter. In their initial 
formulation of the dual-center theory, 
Anand and Brobeck paid scant atten- 
tion to the possibility that structures 
other than the hypothalamus might be 
involved in the regulation of food intake, 
but in a recent article, Anand (1959) 
notes what he terms “cerebral influ- 
ences” on hunger. However, he suggests 
that these are secondary and are medi- 
ated by way of the hypothalamus, which 
is still considered to be the site of pri- 
mary hunger receptors. A more moder- 
ate view (and one that seems to accord 
better with the known facts) is that the 
Importance of the hypothalamus lies in 
the fact that it is a point of convergence 
for a great many fiber systems, all of 
which act on lower reflex arcs control- 
ling feeding (Morgane, 1961a). One 
would, therefore, expect the effects of 
hypothalamic lesions to be more severe 
simply because there would be fewer 
alternate pathways available following 
hypothalamic than following other types 
of brain injury. 
Neural control of food intake remains 
a challenging area for investigation, Re- 
search to date has progressed only to 
the stage of mapping crucial areas of 
the brain, but a great deal remains to 
be learned especially with regard to such 
problems as the following: (a) The rela- 
tionship between hyperphagia and 
aphagia. In the case of hypothalamic 
lesions, it was noted that both syndromes 
involved alterations in taste but whether 


ANNETTE EHRLICH 


this is also true when food intake is 
altered by extrahypothalamic lesions is 
not known. (b) The relationship be- 
tween food and water regulation. The 
extensive (and often contradictory) 
literature on the subject of neural con- 
trol of water intake will not be reviewed 
here, but it should be noted that, in 
many cases, alterations in food intake 
or in food motivation are accompanied 
by similar changes in water intake or 
in water motivation (Delgado, 1952; 
Ehrlich, 1960; Morrison & Mayer, 
1957a, 1957b; Paget, 1897; Robinson & 
Mishkin, 1962; Ruch et al, 1956; 
Schreiner & Kling, 1953; Teitelbaum & 
Epstein, 1962; Teitelbaum & Stellar, 
1954; Williams & Teitelbaum, 1959), 
(c) The discrepancy between consum- 
matory and motivated behavior that 
sometimes occurs as a result of brain 
injury. The phenomenon has been noted 
following hypothalamic (Miller et al., 
1950; Teitelbaum, 1957), amygdaloid 
(Schwartzbaum, 1960, 1961), and mid- 
brain (Ehrlich, 1960) lesions, In all of 
the foregoing experiments, the possibility 
that the motivational change was not 
related to feeding as such but to some 
more general change was not adequately 
explored. Yet, some preliminary data 
reported by Ingram (1958), showing 
that avoidance as well as food motiva- 
tion is impaired in hypothalamic hy- 
perphagic cats, makes this a reasonable 
Possibility. (d) The relation between 
hypothalamic and extrahypothalamic 
structures involved in the regulation of 
food intake. Multiple lesion studies or 
combined lesion and stimulation tech- 
niques should help to answer the ques- 
tion of whether the various alternate 
pathways function independently. 
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EXPERIMENTAL INVESTIGATIONS OF THE 
GENESIS OF VISUAL SPACE PERCEPTION? 


WILLIAM EPSTEIN 
University of Kansas 


The methodology and findings of 3 categories of experimental studies 
are reviewed and evaluated: (a) studies of organisms that have been 
deprived of visual stimulation until the time of testing, (b) studies of 
newborn organisms that are tested soon after birth, and (c) studies in 
which E controls the visual stimulation to which O is exposed from 
birth to the time of testing. The evidence provided by these experiments 
suggests that conclusions about the genetic basis of space perception will 
differ depending on the stimulus correlates under consideration. 


This article is concerned with the 
genesis of space perception. It is neces- 
sary at the outset to distinguish the 
question under consideration from two 
other issues with which it is often con- 
fused. The first concerns the role of past 
experience in determining the current 
perceptual event. The second is related 
to the influence of specific practice on 
perceptual discrimination. Arguments 
intended to bear on the question of gene- 
sis are often derived from experiments 
dealing with the latter two issues. For 
example, it is asserted frequently that if 
perceptual judgments can be modified 
by certain types of previous experience, 
then experience must have been required 
for their original emergence. This argu- 
ment has persisted since Helmholtz who 
wrote in 1866 that “whatever, therefore, 
can be overcome by factors of experi- 
ence, we must consider as being itself 
the product of experience and training 
(Helmholtz, 1925, p. 13].” Despite the 
long history of this assumption it must 
be clear that evidence of the malleability 
of perception cannot entail any con- 
clusion about the origin of the experi- 
ence. Thus, one would hardly wish to 
conclude from the observation of the 
Phenomenon of “memory color” (Dunc- 
ker, 1939) that color perception is 
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learned. Pastore (1960) has called at- 
tention to a parallel example in the field 
of genetics where “evidence of environ- 
mental modification of a trait does not 
preclude its genetic determination 
[p. 94].” This argument is mentioned at 
this point only to make clear that the 
issue under consideration is not whether 
space perception can be modified by ex- 
perience. 

For this reason many investigations 
which might otherwise be included, for 
example, studies of adaptation to spatial 
displacement, were omitted (I. Kohler, 
1951, 1962). This rule of omission is 
followed regarding the adaptation 
studies despite the fact that Kohler and 
others seem to feel that these experi- 
ments provide data relevant to a deter- 
mination of the origins of space percep- 
tion. Kohler has maintained that the 
conditions of displaced vision produced 
by the prisms reinstate in the adult an 
earlier state of perceptual development 
comparable to that of the infant. And 
that the processes that lead to adapta- 
tion to the experimentally induced 
displacement are reiterations of the proc- 
esses of original perceptual develop- 
ment. A thorough review of the adapta- 
tion studies conducted by Stratton, 
Snyder and Pronko, Kobler, and othe 
may be found in Smith and Smith (1962, 
Ch. 4-6). 

The investigation 
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sufficient conditions for the initial occur- 
rence of a response indicating that an 
organism is capable of spatial discrimi- 
nation. Can these conditions be specified 
entirely in terms of the organism’s native 
or congenital constitution or must a 
complete statement of these conditions 
include reference to processes of early 
learning? 


THREE METHODS or INVESTIGATION 


Until recently many American psy- 
chologists have been content to relegate 
the genetic question to the realm of 
philosophical inquiry. However, insofar 
as the question is concerned with the 
genesis of space perception and not with 
its epistemologica] status, it can only be 
decided empirically and is not properly 
a matter for philosophy. Fortunately, 
not all Psychologists have abdicated 
their prerogatives of investigation and 
the last decade has witnessed a modest 


The experimental investigations can be 
classified usefull 


il f a very early, 
occasion of stimulation their perception 


of space has been examined, When ani- 
mals have been studied they have been 
reared in total darkness from birth to 
the time of testing. When human sub- 
jects have been Studied, these have been 
(ideally) persons with congenital cata- 
racts whose vision has been restored 
surgically. 

The second category includes the in- 
vestigations of naive, newborn organisms 
tested soon after birth. Animals are 
better suited to this approach than 
humans since human infants are capable 
of very few reliable overt behavioral 
responses soon after birth. Conse- 
quently, human infants have usually 
been studied at a later age (Fantz, 1961; 
Gibson & Walk, 1960). However, there 


WILLIAM EPSTEIN 


has been one study by Wertheimer 
(1961) of the auditory and visual space 
of a human neonate during the first 10 
minutes of life, 

The third category of experiments 
utilizes an entirely different approach. 
In these studies the experimenter con- 
trols the characteristics of the visual 
stimulation to which the animal is ex- 
posed from birth to the time of testing. 
The aim is to determine whether ani- 
mals that have been subjected to dif- 
ferent conditions of stimulation will 
respond differently when confronted with 
a relevant test discrimination. The 
rationale of this approach will be elabo- 
rated when Hess’ (1950, 1956b) studies 
are considered. 


STUDIES or VISUALLY DEPRIVED 
OBSERVERS 


Studies of Animals 


The earliest systematic study was 
performed by Lashley and Russell 
(1934). The object of the study was to 
determine whether rats reared in dark- 
ness from birth to 100 days of age would 
perceive variations of distance. The rats 
were induced to jump from one platform 
to a second platform whose distance 
was varied from 20 centimeters to 40 
centimeters. An apparatus invented by 
Russell (1932) in order to study depth 
discrimination with normally reared rats 
was used. This apparatus allowed the 
experimenter to measure the force 
exerted by a rat in attempting to jump 
the given distance, Variations in force 
were interpreted as signifying corre- 
sponding variations in perceived dis- 
tance. The experiment was conducted 
during a period of 3 days with the rats 
returned to the dark cage at the con- 
clusion of each day’s testing. On the first 
day the animal was removed from the 
dark cage and given five trials in step- 
ping across a gap of 5 centimeters from 
the starting platform to the landing 
platform. Immediately following this 
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Practice, the landing platform was 
moved to 20 centimeters for five trials. 
On the second day the animal began 
with five trials at 20 centimeters. This 
was followed immediately by a sequence 
of nine jumps in this order: three jumps 
at 40 centimeters, three jumps at 20 
centimeters, three jumps at 40 centi- 
meters. On the third day the rat’s dif- 
ference-threshold for distance was deter- 
mined. For this purpose, each rat was 
given one trial at each of nine distances 
ranging from 24 to 40 centimeters in 2- 
centimeter intervals. The distances were 
presented in random order. 

The results showed that upon first 
exposure the rats adjusted the force of 
their jumps to the variations of distance. 
Thus a rat that overestimated the 20- 
centimeter distance on the first test trial 
would consistently reduce the force of 
his jump during the next nine trials. 
However, the first time the 40-centi- 
meter distance was presented (Trial 6 
on the second day), the rat immediately 
reversed the trend and doubled the force. 
This occurred with every subsequent 
change of distance. On the tests for dif- 
ference-thresholds the dark-reared rats 
were almost as accurate as light-reared 
rats that had been tested earlier by 
Russell (1932) in the same situation. 
Lashley and Russell (1934) concluded 
that the visual perception of distance and 
gradation of force in jumping to compensate 
for distance are not acquired by learning, but 
are the product of some innately organized 
Neural mechanism [p. 143]. 


Several doubts may be raised concern- 
ing the Lashley-Russell study. Greenhut 
and Young (1953) were unable to repeat 
Russell’s (1932) original findings with 
100-day-old and albino rats that had 
been reared normally. When they pre- 
sented the initial test distances 1n 
random order they failed to find any 
correlation between the force of the jump 
and distance, However, it should be ob- 
Served that Greenhut and Young used 


electric shock in order to motivate the 
animals to jump. Despite some evidence 
to the contrary (Greenhut & Young, 
1953, pp. 174-175), it is possible that 
this aversive stimulus disrupted the ani- 
mal’s spatiomotor coordination in many 
instances. In fact, Greenhut and Young 
(1953) reported that “the animals were 
emotionally disturbed throughout the ex- 
periment [p. 160].” 

In addition to the results of Greenhut 
and Young, the Lashley-Russell experi- 
ment has been criticized on the grounds 
that opportunities for learning were not 
completely eliminated. In particular it 
has been noted that on every second day 
the dark box was opened in dim light 
while food was supplied. Although each 
interval of light stimulation did not ex- 
ceed 10 seconds, it is possible that the 
cumulative exposure was sufficient to 
produce substantial learning. The five 
practice trials which preceded the first 
test jump may have provided an op- 
portunity for rapid learning. This argu- 
ment gains plausibility from the general 
observation that organisms which have 
reached an advanced maturational level 
require less practice to achieve a 
specified performance level than organ- 
isms that begin their practice at an 
earlier level of maturation (Munn, 
1955). 

Lashley and Russell (1934) recog- 
nized the problems discussed above and 
presented counterarguments that are 
persuasive. However, the only effective 
way to deal with these objections is to 
perform new experiments which are more 
successful in eliminating opportunities 
for learning. The main flaw in the Lash- 
ley-Russell study stemmed from the pan 
ture of the response that they studied. 
The jumping response is a complex 
motor skill which requires practice. Rats 
reared in the dark are deficient in this 
skill. Therefore, it was necessary to ana 
troduce the practice trials, thus provid- 
ing an opportunity for both visual an 
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motor learning. A response indicator 
which is part of the organism’s unlearned 
response repertoire would eliminate this 
necessity. Since the performance of the 
response per se would require no prac- 
tice, the subject could be tested on the 
very first occasion of his exposure to the 
stimulus without prior practice of the 
response. 
Kurke (1955), using a technique 
similar to that employed in the earlier 
studies by Spalding (1875) and Thorn- 
dike (1899), studied the depth percep- 
tion of chicks reared under different con- 
ditions of visual and motor experience. 
Kurke’s procedure was based on the 
observation that lone chicks will run to 
join their fellows. The chick was placed 
on a platform whose height could be 
adjusted. From the platform the subject 
could see a group of six chicks. The as- 
sumption was made that if the chicks 
perceived the height of the platform, 
then they would be reluctant to leave the 
platform before it was lowered to a safe 
level. The platform was set initially at 
a height of 21 inches, and if the chick 
did not jump within 30 seconds, the plat- 
form was lowered 3 inches. This was 
repeated until all subjects had jumped 
or the platform was on ground level. 
Dark-reared chicks were tested at 1,2, 
or 3 days of age. Their performance was 
compared with a group of 3-day-old con- 
trol chicks raised in normal lighting. The 
very young chicks (1 and 2 days) re- 
fused to leave the platform regardless 
of its height. The 3-day-old dark-reared 
chicks did not differ significantly from 
the light-reared control chicks. Nor did 
they differ significantly from 10-day-old 
light-reared chicks whose opportunities 
for jumping experience was limited, 
However, 10-day-old light-reared chicks 
which were subjected to enforced jump- 
ing activity differed from the other goups 
in that they jumped from the platform 
at a greater height. Kurke (1955) con- 


cluded: 


that the lack of visual experience does not 
appreciably affect the chick’s perception of 
depth at an early age. However, the further 
development of depth discrimination appar- 
ently depends in some manner on the integra- 
tion of experientially determined kinesthetic 
cues [p. 195]. 

Kurke’s experiment is not very con- 
vincing. As was the case with the 
Lashley-Russell study, the difficulty is 
due mainly to the response measure. The 
response of jumping is determined by 
other variables besides perceived height. 
Among these is the level of muscular 
coordination and jumping skill which 
has been achieved. This leads to ambi- 
guities in interpreting the results. Dif- 
ferences in performance may reflect dif- 
ferences in depth discrimination, but it 
could also be the case that subjects who 
are equally able to discriminate depth 
will vary in performance because of 
different levels of motor achievement. 

The resolution of this methodological 
problem has been accomplished in 
several studies of color vision (Hess, 
1956a), form discrimination (Fantz, 
1957), and recently in a study of depth 
discrimination by dark-hatched chicks 
(Fantz, 1958a). The chick’s discrimina- 
tion of depth or solidity was examined 
by observing the distribution of unrein- 
forced pecks in response to the simul- 
taneous presentation of a spherical and 
plane circular surface. The rationale 
underlying the test is that if the chick 
shows a significant preference for one of 
the stimuli, that is, pecks more fre- 
quently at one, then the chick must be 
making perceptual discriminations be- 
tween them. If the stimuli differ only on 
the relevant dimension, for example, 
solidity, then the subject must be able 
to discriminate along this dimension. It 
should be noted that, in addition to pro- 
viding a resolution of the methodological 
difficulties described above, the stimulus- 
preference method also reduces the 
likelihood of confusing two different 
achievements: (a) learning a discrimina- 
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tion between stimuli to some arbitrarily 
assigned experimental criterion, and (b) 
discriminating between stimuli. While 
the former achievement presupposes the 
latter, it cannot be utilized as an index 
of the latter achievement except with 
certain qualifications. Suppose the ex- 
perimenter confronts the subject with 
the task of learning a discriminatory 
positive response to a convexity as Op- 
posed to a concavity. As a measure of 
learning the experimenter uses the cri- 
terial level of nine correct responses in 
a block of 10 successive trials. This 
accomplishment presupposes that the 
subject can perceive the depth difference 
between the two stimuli, although he 
must learn that this difference is the 
criterial attribute. Therefore, it would 
be erroneous to conclude that a subject 
that fails to achieve the criterial level, 
for example, correctly selects the con- 
vexity in only 8 of 10 trials, cannot dis- 
criminate depth. It is equally plausible 
to infer that depth is discriminated, but 
has not been selected unequivocally by 
the subject as the basis for responding. 
This argument has been made earlier by 
Zuckerman and Rock (1957) in their 
analysis of the genesis of form percep- 


ulation, The chicks had no experience 
with food or water prio 
experiment. In the 

chicks were tested singly, 3 
second experiment they were tested in 
pairs. The test consisted of a 5-minute 
exposure to a pair of stimuli. The 
stimuli were two hemispheres, one pre- 
sented so that the curved surface faced 
the chicks and the other presented so 
that only a flat circle was visible. The 
stimuli were presented either under di- 
Tect lighting which produced strong 
shading or under diffuse lighting which 
teduced the brightness gradients greatly. 
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Under both conditions the curved sur- 
faces received significantly more pecks 
than the plane surfaces. Apparently 
these chicks, with no visual experience, 
discriminated between the solid and 
plane stimuli. Comparable findings were 
reported by Fantz (1957, Experiment 
5) in his studies of form preference in 
newly hatched chicks. While these new- 
born chicks seemed to make no use of 
light and shade distribution in discrimi- 
nating depth, Fantz reported that 
slightly older chicks that had visual ex- 
perience showed an enhanced preference 
for spherical surfaces when shading was 
present. This observation is consistent 
with Hess’ (1950) findings which will 
be described later. 

Walk and Gibson (1961) realized the 
same methodological objectives by using 
the visual cliff apparatus. This is a simu- 
lated cliff consisting of a board laid 
across a sheet of heavy glass which is 
supported at a desired height above the 
floor. On one side of the board a sheet 
of patterned material, for example, 
checkerboard pattern, is placed flush 
against the undersurface of the glass. On 
the other side the same patterned mate- 
rial is laid on the floor below the glass. 
This is the cliff side. If an organism 
which discriminates depth is placed on 
the center board it will avoid the cliff 
side and step on to the shallow side. If 
the organism cannot distinguish between 
the shallow side and the cliff side, then 
no preference should be observed. In 
these experiments, as was the case in 
Fantz’ study, the discriminatory Te- 
sponse required no pretraining in light, 
and the discrimination did not entail 
cognitive, problem solving elements. 
Rats were dark-reared and tested at the 
age of 90 days. These rats alee the 
same preference for the —_ easa 
group of 90-day-old gees rats. 

Walk and Gibson’s results for the 
dark-reared rats were confirmed in a 


modified experiment py Nealey and Ed- 
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wards (1960). Nealy and Edwards 
noted that the dark-reared rats in the 
Walk-Gibson experiment were subjected 
to a light-adaptation period of 20 min- 
utes prior to testing. This adaptation 
period may have provided the oppor- 
tunity for learning to occur. For this 
reason Nealey and Edwards repeated the 
Walk-Gibson study with the following 
modification. During the 20-minute pe- 
riod subjects were kept in a detention 
box which was so contrived as to permit 
light adaptation without pattern vision. 
These rats showed the same preference 
for the shallow side as rats that received 
the Walk and Gibson treatment. Nealey 
and Edwards also tested 24 enucleated 
rats in order to determine whether a 
preference could be obtained on other 
than visual bases, These rats did not 
show a reliable preference. 

In further experiments Walk and Gib- 
Son (1961) tested dark-reared rats when 
the available stimuli for depth discrimi- 
nation were reduced, When motion par- 
allax was eliminated, leaving only the 

of optical texture 


cate that only motj 
innate cue for depth discrimination on 
the visual cliff. Since visually experi- 
enced animals show a Preference for the 
shallow side when cued only by textural 
density while dark-reared rats show no 
preference, the cue of textural density 
must be learned. This conclusion is rea- 
sonable, but it is not the only interpre- 
tation which can be given to these find- 
ings. An alternative stems from the 
observation that the procedures did not 
in fact isolate motion parallax from tex- 
tural density. Instead, the two gradients 
were opposed to each other. One gradi- 


on parallax is an 


ent was fixed to produce equidistance 
while the other was fixed to produce 
differential distance. In this conflict 
situation motion parallax was decisive. 
However, it cannot be concluded that 
the effectiveness of textural density re- 
quires learning. This conclusion can be 
derived only from the observation of per- 
formance in response to textural density 
in isolation. 

Regardless of the above qualification, 
the general trend of the evidence ob- 
tained in the visual cliff experiments is 
clear. The evidence led Gibson and 
Walk (1960) in an earlier article “to 
venture the rather broad conclusion that 
a seeing animal will be able to discrimi- 
nate depth when its locomotion is ade- 
quate, even when its locomotion begins 
at birth [p. 71].” 


Studies of Humans 


The case of the congenitally blind 
person whose vision is restored surgically 
would appear to provide an opportunity 
for gaining decisive evidence concerning 
the origin of visual perception. Since 
Molyneux’s famous inquiry addressed to 
Locke, philosophers and psychologists 
have looked to these cases for evidence. 
Several reviews have appeared which 
evaluate the results of the investigations 
of restored vision (Dennis, 1934; Sen- 
den, 1932; Wertheimer, 1951). Prior to 
evaluating the evidence it will be useful 
to consider several relevant methodologi- 
cal and definitional questions. 

1. The perception of space includes 
several related, but distinguishable and 
perhaps functionally independent, ex- 
periences. There is the fundamental and 
relatively undifferentiated experience of 
the extensity of the visual scene. The 
scene appears to extend in the third 
dimension and not merely to be a plane 
surface like a Painting in the frontal- 
parallel plane. Within the scene, objects 
have a specific location relative to us 
(absolute distance) and also a location 
relative to other objects in the scene 


imen 


ae 
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(relative distance). In addition, each 
object will appear to be some linear size, 
either absolutely or relative to other ob- 
jects. Finally, some objects will appear 
to be solid and others two-dimensional. 
Every student of perception knows that 
these aspects of the visual world are 
closely related. Nevertheless, it does not 
follow that an observer whose vision is 
tested immediately after surgical restora- 
tion and who is found deficient in one 
of these experiences is totally lacking in 
space perception. Therefore, broad con- 
clusions concerning space perception 
based on performances related to only 
one aspect of space perception should 
be considered cautiously. 

2. These remarks lead directly to a 
consideration of the method of testing. 
It is easier to specify prohibitions than 
to advance positive proposals. Tasks 
which demand prior visual learning for 
successful execution are unacceptable. 
One illustration of this is the designation 
of an object by the appropriate verbal 
label. Thus the subject might be con- 
fronted with a number of objects and the 
question “Which of the objects is a 
cube?” This question cannot be an- 
swered without a prior association of the 
auditory stimulus, “cube,” with the 


visual stimulus produced by a cube. Ob- 
or visual 


viously an observer with no pri 
experience will lack this association. A 
better procedure would be to confront 
the subject with equivalent plane and 
solid objects and then simply ask the 
subject to report whether he perceives 
several different objects OY several in- 
stances of the same object. i 
Similar reservations exist regarding 
another testing method which has often 
been used, Investigators have sought to 
determine whether apparent size is M- 
variant with changes in distance for the 
subject whose sight is newly restored. 
The presence or absence of constancy is 
used to derive conclusions concerning 
distance perception. These conclusions 


are open to question. Size judgment is a 
complex performance which is regulated 
by many variables in addition to distance 
perception. One powerful determinant 
is the conceptualization of the relation- 
ship between size and distance which the 
subject brings to bear on the experimen- 
tal task (cf. Carlson, 1960, 1962). Sub- 
jects with normal vision will produce 
results ranging from marked underesti- 
mation to marked overestimation of size 
depending on their attitude of observa- 
tion. In the absence of knowledge about 

the subject’s attitude of judgment, it is 

impossible to interpret the results of his 

performance. Therefore, prior informa- 

tion concerning the preoperative con- 

ceptualization of the visual world is a 

prerequisite for interpreting the perform- 

ance of the newly-seeing subject. 

The question with which this discus- 
sion was started remains. How should 
the subjects be tested? No ordered 
schedule of tests will be introduced here. 
However, it seems best that an open- 
ended question should initiate the in- 
quiry. It should not direct the subject 
to any specific aspect of the visual world. 
A simple question such as “What do you 
see?” would meet this requirement. The 
response to this question might introduce 
the experimenter to the subject’s pre- 
ferred language of perceptual experience. 
Later questions Or instructions could 
then be phrased in the subject’s lan- 
guage sO that there would be assurance 
of their meaningfulness to him. 

3, Senden (1932), Dennis (1934), 
and Wertheimer (1951) have described 
the postoperative visual disabilities 


which interfere with good vision. For 


instance many patients suffer from exag- 
gerated and involuntary nystagmus 
which may continue for several weeks. 
This interferes with the normal func- 
tioning of accommodation and converg- 
ence. Two additional postoperative ef- 
occur frequently are acute 


fects whic : $ 
dazzle and a narrowing of the visual 
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field. Obviously, tests which produce 
negative results are equivocal if the re- 
sults are obtained prior to the adjust- 
ment of the visual mechanisms. On the 
other hand, if the experimenter elects 
to postpone the test for several weeks to 
allow the inhibitory effects to diminish, 
then there is a possibility that some 


Spatially separated from the subject, The 
ertain in- 


i a he cannot 
estimate their absolute distance. Thus 


hich is only 
he does reach 
t localize the 
he eye. This 


1 foot from him, However, 
out into space and does no, 
object at the plane of t 
reaching response is qualitatively differ- 
ent from the normal groping that the 
blind usually execute in order to identify 
objects tactually. 
The evidence concerning relative dis- 
tance is ambiguous. The most frequently 
used procedure entailed a crude test of 
size constancy. Most examiners report 
that constancy is absent. This observa- 
tion suggests that differences in distance 
were not perceived. However, as was 
noted earlier, this conclusion can be 
challenged. 
Most of the cases reviewed by Senden 
(1932, pp. 264-271) indicate that the 
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discrimination of solidity is absent. For 
example one subject was unable to dis- 
tinguish a ping-pong ball from a compa- 
rable plane figure, for example, a white 
disc of equal diameter. For another sub- 
ject a table in the center of the room 
appeared flat against the farther wall. 

Recently London (1960) provided a 
summary of a Russian report on the 
postoperative newly-seeing which was 
published originally in 1953. This re- 
port is of special interest in that, unlike 
the Senden report, the writer, Pokrov- 
skii, performed the surgery and the post- 
operative visual tests. Of the six cases 
described only two are directly per- 
tinent. These are cases of two children, 
born with mild cataracts which allowed 
the patients the experience of light al- 
though pattern vision was not possible. 
London presents only the author’s state- 
ment of results omitting details of the 
testing procedure. In the only statement 
dealing with space perception Pokrovskii 
reported that 


the children were unable by vision alone to 
determine distance or, more exactly, the dis- 
tance of the nearest objects. When walking, 
they collided with these objects [London, 1960, 
p- 479]. 


This review of the investigation of the 
newly-seeing has revealed the unsatis- 
factory status of this evidence. The 
techniques of investigation are crude and 
the data are incomplete. In fact, there 
is not a single systematic parametric 
study of the visual perception of the 
newly-seeing. In the absence of more 
acceptable studies, the most prudent 
course is to refrain from introducing the 
findings described above as evidence. It 
ìs only fair to add that this judgment 
1s not shared by all who have considered 
these investigations. In his early theo- 
retical statements Hebb (1949) assigned 
great significance to Senden’s findings 
and he has more recently reaffirmed his 
evaluation (Hebb, 1963), 
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STUDIES or VISUALLY NAIVE OBSERVERS 


The study of the newborn organism 
requires that a reliable unlearned indica- 
tor response be available immediately 
after birth, This requirement can be sat- 
isfied for many organisms although not 
in all testing situations. For example, the 
human neonate cannot be tested with 
the visual cliff apparatus. In order to 
employ the visual cliff, the experimenter 
must await the development of locomo- 
tion at which time the organism is no 
longer perceptually naive. Nevertheless, 
the human infant is not entirely excluded 
from investigation. Fantz (1961) and 
Wertheimer (1961) have studied the 
human neonate by observing the syste- 
matic changes in orientation of the eyes 
in response to various stimuli. 


Studies of Animals 


Walk and Gibson’s (1961) work with 
the visual cliff apparatus did not include 
testing newborn observers. However, 
several species (€-g-, the chick and the 
goat) are able to locomote during the 
first day of life. Gibson and Walk re- 
ported that none of their 1-day-old 
chicks, goats, or lambs ever stepped onto 
the glass on the cliff side. Gibson and 
Walk consider it very likely that similar 
results would be obtained with newborn 
animals as well. This expectation Te- 
ceived support from Tallarico’s (1961) 
study of the choice behavior of 3-hour- 
old chicks on the visual cliff. Of the 320 
chicks observed, 90.6% chose the shal- 
low side. Those chicks that stepped onto 
the glass on the cliff side seemed to do 
so inadvertently. 

In another study Fishman and Tal- 
larico (1961) also studied visual depth 
perception in 3-hour-old chicks. As an 
indicator response Fishman and Tal- 
larico selected the avoidance response 
made on the first occasion of an ap- 
proaching object. Six groups of chicks 
were tested, Three groups consisted 
of chicks reared in the light. The re- 


maining three groups consisted of 
chicks reared in the dark. All chicks 
were tested individually at the age 
of 3 hours. There were three condi- 
tions of testing. In Condition FP 
(feigned poke) one experimenter held 
the chick in his fist while the second 
experimenter feigned a poke at the left 
eye of the chick. This consisted of mov- 
ing a black pocketcomb from a distance 
of 1 foot to within 4 inch from the sub- 
ject’s eye at the rate of 1 foot per second. 
The experimenter recorded the occur- 
rence of a spontaneous head movement 
away from the direction of the poke. In 
Condition EF (extended fan) no poke 
was feigned. Instead a black fan held 
about 1 foot from the subject was opened 
in about a second at a right angle to the 
subject’s line of sight. In Condition C 
(control) the experimenter held the 
chick as before and for the same length 
of time but without introducing any 
stimulus. 

The dark-reared chicks did not differ 
from the light-reared chicks under any 
of the three conditions. The control 
chicks made very few “ayoidance” head 
movements. Of the 73 control chicks 
only five spontaneous head movements 
were observed which were similar to the 
aversive movements of the experimental 
subjects. For both light-reared and dark- 
reared chicks the number of avoidance 
responses under Condition FP were sig- 
nificantly greater than under Condition 
EF. In fact only 1 of 18 chicks gave an 
avoidance response to EF, while 44 of 
96 chicks responded to FP. These re- 
sults are evidence that visually naive 
chicks can discriminate depth. The 


2 However, Riesen (1947, 1950) reported 


-reared chimpanzees «gid not blin! 
fat tien : ward the face [and. 


advanced slowly towar 


because it did, 
of the subject’s el 
tion his chimps suffered from 


oculomotor disturbance at the 
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avoidance reaction seems to be a re- 
sponse to the unique stimuli provided by 
an approaching object. It is nota startle 
response made to a novel stimulus since 
it did not occur under Condition EF. 
Nor does it occur with great frequency 
in the absence of stimulation (Condition 


It would be interesting to determine 
whether the newborn chick can indeed 
distinguish between the optical stimuli 
produced by a change in position and 
the comparable stimuli produced by an 
equivalent change in the size of a sta- 
tionary object. Apparently infant rhesus 
monkeys respond to the isolated presen- 
tation of a closed contour undergoing 
symmetrical expansion as if it were an 
approaching object (Schiff, Caviness, & 
Gibson, 1962). On the other hand the 
chicks in Fishman and Tallarico’s study 
did distinguish between the optical 
stimuli produced by FP and EF. How- 
ever, since these two conditions were not 
exactly comparable no firm conclusion 
can be drawn concerning our question. 
The apparatus and stimulus events em- 
ployed by Schiff et al. would be very 
appropriate for this purpose. 


Studies of Humans 


There have been even fewer studies of 
naive humans. Although the infants 
studied by Fantz (1961) had a median 
age of 15 weeks this investigation will 
be included. The only experiment which 
investigated newborn humans was re- 
ported by Wertheimer (1961), 

Fantz (1961) employed the stimulus- 
preference method using the direction 
of orientation of the eyes as an indicator 
of preference. A solid object and a com- 
parable flat object were exhibited for 
a series of 20-second trials and the sub- 

ject’s direction of gaze was recorded. 
The criterion of fixation was the super- 
position of the reflection of the object 
over the pupil. The subjects were 52 
infants with a median age of 15 weeks. 


The test objects were a sphere and a 
disc of equal diameter. Each infant was 
tested under eight conditions: binocu- 
lar and monocular vision, presence and 
absence of surface texture, direct and 
diffuse lighting. A differential response 
was obtained only in the tests using tex- 
tured objects under direct lighting. The 
other six conditions did not yield a dif- 
ferential response. A curious finding 
emerged from these results. Infants un- 
der 3 months did not give a differential 
response in the binocular test but the 
same subjects showed a clear preference 
in the monocular test. This result, to- 
gether with others, suggested to Fantz 
(1961) 

that the use of both eyes interferes with vision 
in the early months before the development of 
good binocular coordination, while binocular 
vision improves visual performance later on 
(pp. 29-30]. 

The results led Fantz to conclude that 
young infants discriminate solidity when 
gradients of texture and brightness are 
available. Binocular cues did not ap- 
pear to be essential in the presence of 
these gradients nor were binocular cues 
sufficient in the absence of differential 
gradients. It is interesting to note that 
whenever a differential response 0C- 
curred it was a preference for the solid 
object. It may be that the basis for this 
preference is the more general visual in- 
terest which young infants display in 
response to complex visual patterns 
(Berlyne, 1958; Fantz, 1958b). It is 
immediately evident that the presence of 
a general tendency of this sort intro- 
duces difficulties in interpreting Fantz’ 
(1961) results. It is not possible to 
know whether the differential responsive- 
ness to the sphere reflects a depth dis- 
crimination or a pattern preference. 

Wertheimer (1961) studied the psy- 
chomotor coordination of auditory and 
visual space of one human infant before 
she was 10 minutes old. The subject 
was born without anesthesia, by natur: 


childbirth. Testing commenced 3 min- 
utes after birth. On each trial a click 
was sounded next to the right or left ear 
of the subject who was lying on her back. 
Two experimenters recorded the infant’s 
eye movements in response to the click. 
Fifty-two successive trials were under- 
taken, On 45 of the 52 trials both ex- 
perimenters were in agreement concern- 
ing the eye movement. In 23 of these 
cases there was no movement. On the re- 
maining 22 trials 18 of the eye-move- 
- ment responses were in the direction of 
the click and four in the opposite direc- 
tion. Thus within 10 minutes after birth 
auditory localization and a coordinate 
primitive spatial localization is present 
in the human infant. 


EXPERIMENTAL VARIATION OF THE 
RELEVANT STIMULATION 


The only investigations in this cate- 
gory are those performed by Hess (1950, 
1956b, 1961). The earlier study was 
concerned with the development of depth 
responses to shading cues. Hess (1950, 
1961) reared two groups of 50 chicks 
under different conditions of illumina- 
tion. The control chicks were reared in 
cages illuminated from above. The ex- 
perimental chicks were reared in cages 
which were illuminated from below. 
Thus the two groups experienced differ- 
ent distributions of light and shade. 
They remained in these cages for a pe- 
tiod of 7 weeks after hatching. 

The test enclosure was a circular cage 
illuminated by lights at opposite sides 
of the enclosure. The test objects were 
two photographs placed side by side. 
The photographs showed grains of wheat 
in their natural coloring but about twice 
the normal size. One picture was right- 
side up and the other upside down. In 
the first experiment chicks were tested 
at the age of 7 weeks. They were put 
into the test cage in groups of two to 
four chicks, First they were permitted to 
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eat a few grains which were fastened 
vertically on a cardboard. Then the 
photographic plate was substituted and 
the chicks’ responses to the photographs 
were recorded for a 10-minute interval. 
The chicks were then returned to their 
rearing environments and retested a 
week later. At that time the illumina- 
tion in the test cage originated from the 
direction opposite to that of the light in 
the rearing cages. Because of the wide 
range in the number of responses made 
by individual chicks Hess analyzed the 

first pecks only. The results were the 

same for the test and retest. The chicks 

showed a statistically significant tend- 

ency to peck at the photograph repre- 

senting the distribution of shading con- 

sistent with their rearing. For instance, 

on the retest, 939 of the first pecks 

made by the control group were made 

on the photograph showing grain illumi- 

nated from above, while only 6.1% of 

the first pecks of the experimental groups 

were on that photograph. 

The second experiment studied the 
development of depth responses in the 
experimental animals. At 1-week inter- 
vals beginning at 1 week and continuing 
to 7 weeks of age, groups of chicks were 
taken from the rearing cage and tested. 
The main finding of the experiment was 
that the establishment of a significant 
preference for the photograph represent- 
ing overhead illumination occurred 
earlier in the control than the establish- 
ment of a significant preference for the 
opposite photograph in the experimental 
group. 

The results of the first experiment 
show that the chick’s responses to light 
and shade cues can be governed by the 
direction of lighting in its environment. 
This does not require the conclusion that 


no preference would be exhibited in the 
absence of experience. It may be that 
an unlearned response to illumination 


from above exists, but that this innate 
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preference can be extinguished if the 
environment does not support it. In 
fact, such may be the implication of the 
results of the second experiment. If no 
initial unlearned preference existed then 
the control and experimental groups 
should develop a preference with equal 
rapidity. On the other hand, if the ex- 
perimental group has an initial prefer- 
ence for illumination from above, then 
they must unlearn this preference prior 
to acquiring the new response. This 
could explain the finding that the ex- 
perimental group established a signifi- 
cant preference later than the control 
group. 

One incidental finding of this study 
which warrants notice is that the photo- 
graphs of grain were responded to as if 
they were real grains. Apparently, for 
chicks, pictorial representations yield 
depth responses in the absence of a his- 
tory of experience with representations. 
Although it has been amply demon- 
strated that human observers may have 
compelling experiences of depth when 
presented with photographs (e.g., Smith 
& Smith, 1961), it is not known whether 
experience with photographic represen- 
tations is a necessary condition. A re- 
cent study by Hochberg and Brooks 
(1962) found that a child “is capable of 
recognizing pictorial representations of 
solid objects without specific training or 
instruction [p. 628].” They infer, there- 
fore, “that there must be an unlearned 
propensity to respond to certain formal 
features of lines-on-paper [p. 628].” 
This does not answer our question; how- 
ever, it does lead to the expectation that 
similar results will be obtained for depth 
perception. 

In a later study Hess (1956b) ex- 
amined a more general question: “When 
an organism first perceives the environ- 
ment can it accurately see where things 
are [p. 74]?” Specifically, the experi- 
ment attempted to determine whether 
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the chick’s ability to localize and peck 
accurately at objects is innate or learned. 
Chicks were hatched in darkness and 
divided into two experimental groups 
and one control group. The experimen- 
tal chicks were fitted with hoods con- 
taining prisms which displaced the visual 
field either 7 degrees to the right or 
7 degrees to the left. The control chicks 
were fitted with hoods which did not 
contain image-displacing lenses. When 
they were 1 day old the chicks were 
tested for pecking accuracy. They were 
allowed to peck at a small brass nail 
embedded in modeling clay. The clay 
provided a visual record of the location 
of the chick’s pecks. 

The following is the rationale of this 
procedure. If spatial localization is 
learned then the early responses of the 
chick should be randomly distributed in 
the visual field. Only through practice 
should the pecks converge upon and 
finally strike the target. If this is a 


valid analysis of the development of | 


localization, then the displacing prisms 
should not affect the development of 
localization. In terms of this analysis 
the experimental and control groups 
both must learn to localize objects 
through a gradual process of trial and 
error pecking. On the other hand, if 
spatial localization is innate then the 
first pecks of the experimental chicks 
should cluster about a point 7 degrees 
to the right or left of the target while 
the pecks of the control chicks should 
center about the target. The results 
agreed with the latter:expectations. 
Following the test on the first day 
Hess divided each of the three groups in 
half. Half of each group was placed in 
a living enclosure in which grain was 
scattered loosely on the floor, ‘The other 
half of each group was placed in a box 
which contained bowls of mash. When 
the chicks were 3-4 days old, they were 
tested again. There was no difference 


ee  K.. 
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between the two halves of the control 
group. Both halves showed an increase 
in accuracy and both were in good 
physical condition. The two halves of 
the experimental group also improved 
their accuracy of pecking. However, 
their improvement was represented by a 
tighter cluster 7 degrees to the right or 
left of the target. The two halves of 
the experimental groups differed mark- 
edly in physical health. The group 
whose living cage contained bowls of 
mash was as healthy as the control 
group. The half whose living cage con- 
tained scattered grain only were in poor 
physical condition. Two of these chicks 
died on the next day. Hess concluded 
that spatial localization is innate for 
the chick and that the chick is unable 
to unlearn the innate localization of ob- 
jects even when survival is at stake. A 
similar absence of adaptation is observed 
in the persistent spatial disorientation 
which results from the more dramatic 
modifications of sensory input produced 
by surgical procedures (Sperry, 1951, 
1958). For example, surgical rotation 
of the eyes, producing an inverted visual 
field, leads to mislocalization without 


any subsequent adaptation. 
CoNCLUSION 


ecision about the genesis 


of space perception is possible. Nor 
does it seem likely that a conclusion of 
general applicability is forthcoming. In- 
stead the trend of the evidence strongly 
Suggests that separate statements will 
have to be made about each of the vari- 
ous stimulus correlates of depth per- 
ception. For example, there is evidence 
that motion parallax is an unlearned 
correlate of relative distance while dif- 
ferences in textural density are learned 
correlates (Walk & Gibson, 1961). This 
conclusion was clearly anticipated by 
Carr (1935) in his treatise on space 
Perception: 


No simple d 


There is no single phenomenon or problem 
of space. It is impossible to make any state- 
ment as to the native or empirical nature of 
space in general [p. 5; italics added]. 


This conclusion poses two questions 
whose resolution would provide further 
clarification of the issue. The first con- 
cerns the modifiability of innately de- 
termined perceptual preferences by later 
experience. It seems likely that the vari- 
ous stimulus correlates of depth percep- 
tion will vary in their susceptibility to 
the modifying influence of special ex- 
perience. The second question concerns 
the nature of the learning processes 
which underlie the formation of associa- 
tions between specific cues and the con- 
comitant visual experience of depth. 
This latter question is an extremely diffi- 
cult one as Pastore (1956) has pointed 
out in his critique of the empiristic view- 
point. Nevertheless, as evidence is ob- 
tained (e.g., Hess, 1950) that certain 
cues are learned, the need for an analysis 
of this question becomes compelling. 
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Studies from the Iowa laboratory and elsev 

i E where that hav 

comparison of the eyelid conditioning performance of bee ees 
extremes of the Taylor Manifest Anxiety (MA) scale are reviewed an: 
21 of 25 independent comparisons, differences between groups were in 


favor of the high anxiety (HA) Ss, 


with the majority being statistically 


° significant. Although these data provide substantial confirmation of the 


implication of the drive interpretation of MA scale that HA Ss should 


exhibit a higher level of performance than LA Ss, an attempt was made 
to ascertain what factors might be responsible for failure of the difference 
to occur in some studies. The major factors appeared to be small num- 
bers of Ss and the presence of “voluntary form” responders in the samples. 
Significant differences appear to be related to the degree of experimental 


naiveté of the Ss and the extent to which the experimental situation is 
designed to arouse some degree of apprehensiveness. 


Recently King, Kimble, Gorman, and 
King (1961) reported failure to find a 
significant relation between level of emo- 
tionality as measured by the Taylor 
Manifest Anxiety (MA) scale and per- 
formance in aversive (eyelid) condi- 
tioning. In the light of the previous evi- 
dence, this finding was rather surprising 
and the authors were able to conclude 
only that emotionality or anxiety must 
be an interacting variable that is re- 
lated to conditioning under certain as 
yet unknown conditions and not under 
others, The purpose of the present ar- 
ticle is (a) to review the data bearing 
on the relation between the MA scale 
and aversive conditioning, only a small 
fraction of which was cited by King et 
al., and (b) to examine this evidence in 
detail in an attempt to suggest what the 
factors are that might be responsible for 
the failure of the relationship to appear 
In some studies. 

In presenting these data, the results 


1 This study was carried out as part of a 
Project concerned with the influence of moti- 
Yation on performance in learning under Con- 
fate Nonr-1509(04), Project NR 154-107 be- 
ag een the University of Iowa and the Office 

Naval Research. 


of studies comparing performance in 
simple conditioning of Ss preselected on 
the basis of extreme MA scale scores will 
first be summarized. The second group 
of studies to be reviewed also involves 
comparison of the performance of pre- 
selected Ss but in differential condition- 
ing. Thirdly, the results of a number of 
previously unpublished comparisons 
based on simple conditioning data from 
studies conducted in the Iowa laboratory 
for other purposes will be presented, The 
Ss serving in these latter studies were 
unselected with respect to the MA scale. 
Since their scores on the test were avail- 
able, it was possible to determine the 
differences in the conditioning perform- 
ance between Ss who scored at the ex- 
tremes of the scale. 


STUDIES INVOLVING PRESELECTED 
EXTREME GROUPS 


Simple Conditioning. The findings of 


the previously published Towa studies 


and one unpublished study directly con- 
cerned with comparing the conditioning 
performance of preselected high and low 
anxiety (HA and LA) Ss are sum- 


marized in Table 1. The criteria 0 
129 
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TABLE 1 


Iowa STUDIES OF CONDITIONING PERFORMANCE OF PRESELECTED HA Anp LA Scaze Ss 


P No. Ready UCS No. Percent CR Difference 
Towa experiments trials signal (psi) S = HA IA (H-L) 

1. Taylor (1951) 80 Yes 1.6 60 59.6 27.9 21.7 .001 
2. Spence-Taylor (1951) 100 Yes 6 50 48.2 33.8 14.4 05" 
` 100 Yes 2.0 50 55.0 41.7 13.3 05 

3. Spence-Farber (1953) 60 Yes 1.0 64 48.8 34.1 14.7 05 
4. Spence-Beecroft (1954) 50 Yes 1.0 45 56.5 36.3 20.2 02 
5. Spence-Weyant (1960) 100 No 25 36 41.8 28.6 13.2 “10° 
100 No 2.0 36 65.4 53.2 12.2 10 

6. Spence (unpublished) 80 Yes .25 60 36.5 21.6 14.9 02° 
80 Yes 1.50 60 48.0 38.8 9.2 02 


a The F value based on 
b The F 


all groups provided a p value < .01. 
value based on all groups provided a p value < .05. 


© The F value based on all groups provided a p value < .01. 


selection of Ss in these studies varied 
slightly, ranging from raw scores of 20 
to 24 as the lower limit of the HA Ss 
and from 7 to 9 as the upper limit of 
the LA Ss. These scores roughly mark 
the upper and lower twentieth to 
twenty-fifth percentiles of the distribu- 
tion of MA scale scores made by students 
in the introductory course in psychology 
at Iowa. Shown in the successive col- 
umns of the table are: (a) the reference 
studies, (b) the number of trials over 
which the conditioning measures were 
obtained, (c) the use or nonuse of a 
ready signal, (d) the strength of the 
UCS (air puff) employed, (e) the total 
number of Ss in the two groups, (f) the 
mean percentage of CRs made by the 
HA group of Ss, (g) the mean percent- 
age of responses made by LA Ss, (k) 
the difference in percentage of CRs given 
by the two groups, and (7) the signifi- 
cance level of the differences between 
the groups expressed in terms of p (two- 
tailed test). 

As may be seen, the six Iowa studies 
provided nine independent comparisons 
of HA and LA groups at UCS intensi- 
ties ranging from very weak (.25 psi) 
to moderately strong ones (2.0 psi). In 
every study HA Ss responded with a 


higher percentage of CRs over the con- 
ditioning period than did the LA Ss and 
in each the difference was significant at 
the .05 level or better. Since the condi- 
tioning curves of the HA and LA groups 
tended to diverge, it is apparent that the 
differences in their performance at the 
end of training were even larger than 
those shown in Table 1. Thus in the case 
of these six studies the differences were 
approximately 20% larger over the last 
20 conditioning trials than for all trials. 
These larger differences also tended to 
be slightly more significant. Attention 
should also be directed to the fact that 
the numbers of Ss on which the signifi- 
cance values obtained in these studies 
were based are relatively large, ranging 
from a minimum of 45 in studies with 
two groups to a maximum of 120 when 
four groups were used. 

In turning to the non-Iowa studies in 
Table 2, it will be seen that not a single 
one of the comparisons involved as many 
as 50 Ss. The study (No. 3) with the 
largest number of Ss (36), it is worthy 
of note, did give results in line with those 
obtained in the Towa laboratory. In this 
study, conducted at Kent State Univer- 
sity, HA Ss responded at a significantly 
higher level (.01) than LA Ss. The 
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TABLE 2 
Non-Iowa STUDIES OF CONDITIONING PERFORMANCE OF HA anp LA Scare Ss 


No. Ready 


UCS No. Percent CR Difference 


Non-Iowa experiments 2 f — 

trials signal (psi) Ss HA LA (H-L) 5 
1. Hilgard et al. (1951) 60 Yes 1.6 5 3 
2. Prokasy-Truax (1959) 20 No 3.0 D ee re aa z 
3. Baron-Conner (1960) 80 No 1.6 36 44.9 27.8 “TE o1 
4. King et al. (1961) so Yes 15 3 aa o an S 
5. King et al. (1961) s% No 15 32 668 640 2 
6. King et al. (1961) so Yes 14 30 490 513 —23 Zs 


remaining comparisons in this portion of 
the table, however, failed to support 
such a finding, that is, there were no 
significant differences between groups. 
The first two of the studies probably 
should not be taken too seriously as 
they involved only 10 Ss in each ex- 
treme group. The variability of condi- 
tioning performance among individual 
Ss is so great that such results would 
not be unexpected with so few Ss. How- 
ever, the final three comparisons, all of 
which were reported by King, Kimble, 
and their students (1961) present a 
more formidable array of evidence that 
is quite contrary to the findings of the 
Iowa laboratory. Not only did they fail 
to obtain a significant difference in 
favor of the HA Ss, but two of the dif- 
ferences, though not significant, were 
actually in favor of the LA Ss. While 
only 30-32 Ss were involved in each ex- 
periment, the results as a whole are 
clearly opposed to the Iowa findings. 

In addition to the experiments con- 
tained in Tables 1 and 2, the findings 
of three further published studies that 
bear on the relation between the MA 
Scale and performance in simple condi- 
tioning should be mentioned. In each of 
these investigations unselected Ss were 
run in the experiment and comparisons 
Were subsequently made between Ss who 
Scored in the upper and lower /alves of 
= distribution of scores on the MA 
oe Two of these studies were pri- 

arily interested in the effects of ad- 


ministrating shocks on the level of con- 
ditioning performance. In the first, 
Spence, Farber, and Taylor (1954) 
found that HA Ss (V=15) responded at 
a significantly higher level in Trials 
1-40 than LA Ss (V=25) under the 
shock condition (p<.01). Under no 
shock HA Ss also gave more CRs than 
LA Ss, but with only 10 Ss per group the 
difference was not significant. In a par- 
tial replication of this experiment carried 
out at Peabody College, Caldwell and 
Cromwell (1959) compared Ss, 30 from 
the upper and 30 from the lower halves 
of the distribution of MA scale scores 
and obtained a significant relation (.05 
level) with conditioning performance 
over Trials 1-40. In each of these experi- 
ments, however, conditioning perform- 
ance over Trials 41-80 was not related 
either to shock-no shock or to anxiety 
level. The third and final study that 
compared above and below average Ss 
on the MA scale was conducted with 
students from the evening classes of 
Northwestern University (Spence & 
Taylor, 1953). Over the 80 conditioning 
trials, 22 HA Ss averaged 32.2% CRs, 
21 LA Ss, 18.2%. The difference was 
significant at the .04 level. 

Differential Conditioning. Table 3 pre- 
sents the data from two reported investi- 
gations of the relation between the MA 
scale and level of performance to the 
positive CS in differential conditioning. 
As in the studies of Table 1, HA and 
LA groups represent roughly the upper 
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TABLE 3 


STUDIES OF PERFORMANCE OF HA anp LA Scare Ss to Positive CS N 
DIFFERENTIAL CONDITIONING 


No. Ready No. Percent CR Difference 
Study trials signal Ss HA IA (H-L) 
1. Spence-Farber (1953) 31-50 Yes 36 50.0 39.7 10.3 05 
2. Spence-Farber (1954) 31-50 Yes 52 411 31.7 9.4 09 
3. Prokasy-Whaley (1962) 26-45 Yes 60 54.5 39.5 15.0 05 
4. Prokasy-Whaley (1962) 26-45 No 70 60.2 56.7 3.5 — 


and lower quartiles on the MA scale. 
Spence and Farber (1953, 1954) con- 
ducted two separate experiments differ- 
ing slightly in conditioning methodology. 
The data presented in the table show the 
level of response to the positive CS over 
the last 20 reinforced trials (31-50). 
As may be seen, in the first experiment, 
the HA group gave significantly more 
CRs than the LA group. The second 
experiment of Spence and Farber, as 
originally reported, included Ss on the 
basis of a forced-choice version of the 
scale or on the basis of the MA scale, 
The data presented here are based only 
on Ss who were identified as high or 
low on the MA scale. Again, as may 
be seen from the table, HA Ss responded 
at a higher level than LA Ss, The dif- 
ference was not significant at the .05 
level by a two-tailed test, but was on 
the basis of a one-tailed test, 

Using a factorial design, Prokasy and 
Whaley (1962) investigated whether the 


relation between performance on the MA 
scale and differential conditioning was a 
function of the presence or absence of a 
ready signal in the experimental pro- 
cedure. Table 3 presents the percentage 
of CRs given to the positive CS on the 
last 20 reinforced trials (26-45). As 
may be seen, when a ready signal was 
employed HA Ss gave a significantly 
larger number of CRs than did LA 
Ss. In the absence of a signal, however, 
the superiority of the HA group was 
small and not significant. 


UNPUBLISHED Data INVOLVING 
UNSELECTED SUBJECTS 


The third set of studies providing rele- 
vant evidence is presented in Table 4. 
Included in this group are the findings of 
two previously published studies (No. 1 
& 2) that were concerned with investi- 
gating among unselected Ss the relation 
between conditioning measures and 
physiological indices of emotionality 


TABLE 4 
Iowa STUDIES INVOLVING UNPUBLISHED DATA FROM POSTSELECTED Ss 


Study No. Ready UCS No. PercentCR Difference 
trials signal (psi) Ss HA LA (H-L) ? 
1, Runquist & Ross (1959) 41-80 + 10 27 400 ms à mai O 
2. Runquist & Spence (1959) 41-80 + 10 34 570 382 18.8 .05 
3, Individual differences: 1-80 + 1.0 52 41.8 45.5 —3.7 =< 
(unpubl.) 
4. Extinction: (unpubl.) 1-30 — 20 39 643 536 10.7 05 
Sa. Conditioning problem: 31-60 — 6 67 6B 476 142 0 
ý npubl.) 
Male (unp ja a 


5b. Conditioning problem: 
Female (unpubl.) 


6 31 71.0 59.0 12.0 me 


ae 
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(e.g., heart rate changes, GSR, and 
muscle action potential). As the scores 
on the MA scale were available it was 
possible to examine the conditioning per- 
formance of Ss who scored in the upper 
and lower quartiles of the distribution of 
scores on the scale. As the table shows, 
HA Ss gave a significantly greater num- 
ber of CRs than LA Ss in both of these 
studies. 

The remaining data presented in 
Table 4 come from three as yet unpub- 
lished studies from the Iowa laboratory. 
Each is identified by a word or label 
describing the primary interest of the 
study. In each of these investigations a 
fairly large number of Ss, the MA scale 
scores of whom were available after 
completion of the experiment, were con- 
ditioned under identical circumstances. 
Again, the findings reported are for Ss 
who scored in the upper and lower quar- 
tiles of the MA scale distribution. As 
may be seen, the three comparisons (No. 
4, 5a, & 5b) provided by the last two 
experiments once again show that HA Ss 
responded at a higher level than LA Ss. 
Two of these differences were significant, 
while the third (that involving the small- 
est number of Ss) was not. Attention 
should be directed to the fact that a 
ready signal was not employed in these 
latter studies. The relation of the use or 
nonuse of a ready signal to the findings 
of these studies will be discussed in a 
later section. 7 

The final item of note in Table 4 is 
the negative difference found in the 
study entitled “Individual Differences 
(No. 3). This is the only instance among 
a total of 17 comparisons of HA and LA 
Ss provided by our Iowa studies in 
which LA Ss gave a larger number of 
CRs than HA Ss. The small difference 
1S not, of course, significant. 


Discussion 


. Looking at the findings of the studies 
Mcluded in Tables 1, 2, 3, and 4 as a 


whole, a number of characteristi 

be noted. First, the N ke 
stances in which the conditioning per- 
— of the HA group was higher 
i an that of the LA group is much 
pae fion one would expect by chance. 

, the results of 21 of 25 comparisons 

were in this direction. If there actually 
were no relation between the MA scale 
and conditioning performance, the prob- 
ability of obtaining such a percentage of 
differences (84%) in the same direction 
by chance is less than .01. 

Secondly, it is clearly evident that the 
studies with relatively large numbers of 
Ss tended to provide significant differ- 
ences in favor of HA Ss, whereas those 
with smaller numbers did not. Thus, in 
the case of comparisons involving 36 or 
more Ss, 65% (11/17) of the compari- 
sons were significant at the .05 level or 
better on the basis of a two-tailed test. 
On the other hand, in studies involving 
fewer than 36 Ss only two of the eight 
comparisons provided a significant re- 
sult. Moreover, all of the 13 significant 
differences were in the direction of higher 
conditioning performance on the part of 
the HA Ss, whereas none of the four ob- 
tained differences in favor of the LA Ss 
were significant. 

As was mentioned earlier, intersubject 
variability in eyelid conditioning is ex- 
ceedingly great. In our experiments the 
percentages of CRs given by individuals 
customarily range all the way from zero 
to very high values. The standard devia- 
tions of these percentage measures vary 
roughly between 20 and 25%. Under 
such circumstances it is readily apparent 
that fairly large numbers of Ss must be 
sampled if an adequate test as to 
whether the manipulated variable pro- 
duces a difference is to be made. This is 
particularly the case if the size of the 

difference is not large relative to the 
variance of the measures. Apparently, 
degree of emotionality, as specified in 
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terms of extreme scores on the MA scale, 
is a relatively minor factor or variable 
among all those contributing to the in- 
tersubject variance of conditioning per- 
formance. In this connection, it is in- 
teresting to note that in two studies in 
which emotionality differences and dif- 
ferences in UCS (puff) strength were 
poth variables, the differences between 
the conditioning performances of the HA 
and LA Ss were of about the same order 
of magnitude as those obtained between 
relatively weak and strong puff intensi- 
ties (.25 vs. 1.5 psi and .6 vs. 2.0 psi). 
In the case of both types of comparison, 
individual performances ranged from 
very low to high levels in the groups 
being compared and the overlap was con- 
siderable, 

In the light of these considerations, it 
is not at all surprising to find such di- 
vergent findings among studies that em- 
ploy such small numbers of Ss, nor is it 
difficult to understand why the Iowa 
studies, with their relatively larger sam- 
ples, have tended to be more consistent 
and to give a higher proportion of sig- 
oo differences in favor of the HA 

s. 

A third characteristic of the data pro- 
vided by these studies is that the rela- 
tion between the MA scale and condi- 
tioning does appear to vary from one 
laboratory to another. This fact is 
clearly revealed in Table 5, which pre- 
sents an analysis of the data in terms of 
their source. As may be seen, we have 
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divided the studies into three groups, 
those carried out in the Iowa laboratory 
and those from other laboratories. The 
latter, in turn, have been broken down 
into two subgroups, primarily on the 
basis of the strikingly different findings 
obtained in them. 

It is clearly evident from the data of 
Table 5 that the Iowa studies have most 
consistently demonstrated a relation be- 
tween conditioning performance and 
emotionality (MA scale). Thus 16 
(94%) of the 17 comparisons gave dif- 
ferences in favor of the HA Ss, with 10 
(60%) of these differences being signifi- 
cant at the .05 level or better. Contrast- 
ing most sharply with these findings are 
those obtained in the subgroup of non- 
Iowa studies from Duke and North 
Carolina. These studies, carried out or 
supervised by Kimble and his associates, 
failed to find a significant difference in 
three comparisons of the HA and LA Ss 
and, moreover, found that LA Ss re- 
sponded at a higher level than HA Ss in 
two of their three experiments. Falling 
in between, but closer to the results of 
the Iowa studies, are the findings from 
the four other non-Iowa laboratories. 
Thus, in four of five comparisons, HA 
Ss gave more CRs than LA groups, with 
two of these being significant. Recalling 
that two of the studies in this latter 
group had only 10 Ss per group, we see 
that two of the three experiments that 
had groups of 36 or more obtained sig- 
nificant differences, 


TABLE 5 


COMPARISON OF DIFFERENCES BETWEEN HA anp LA Ss’ CONDITIONING 
PERFORMANCE OBTAINED IN DIFFERENT LABORATORIES 


No. Mean Percent Pi : 
t : ercent 
Laboratory comparisons No. Ss HA > LA significant (.05) 
Iowa 17 47.0 94 60 
Non-Iowa I* 5 41.2 80 30 
Non-Iowa II” 3 31.3 33 5 


a Obtained in the university laboratories of Stanford, Wisconsin, Pennsylvania State, and Kent State. 


b Obtained in the universi 


ty laboratories of Duke and North Carolina. 
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In summary, we see that when studies 
with a reasonably large number of Ss 
(36 or more) are considered, the experi- 
ments conducted in the Iowa laboratory 
and those from laboratories other 
than Duke and North Carolina tend to 
be in close agreement with each other in 
pointing to a genuine difference in the 
conditioning performance of the HA and 
LA Ss. The studies of Kimble and his 
associates, on the other hand, do not. 

In an effort to ascertain what factors 
might be responsible for these contrast- 
ing findings a comparison was made of 
the experimental conditions under which 
the studies were conducted. For this 
purpose a more detailed description of 
the experimental situation, procedures, 
and experiences of the Ss was obtained 
through correspondence with the investi- 
gators involved. 

With respect to the conditions under 
which the experiments were conducted 
at Iowa, we were greatly influenced by 
the writer’s theoretical interpretation as 
to why HA Ss could be expected to 
perform at a higher level than LA Ss. 
According to this theory, the differences 
in conditioning performance of these 
two groups of Ss reflect differences in 
their level of generalized drive (D), 
which, in turn, are assumed to be the 
result of differences in their level of 
emotional reactivity to the experimental 
situation and procedures. Accordingly, 
a deliberate attempt was made in the 
Iowa studies to provide conditions in the 
laboratory that might elicit some degree 
of emotionality. Thus, the experimenter 
was instructed to be impersonal and 
quite formal in greeting S and in giving 
the necessary instructions. On coming 
into the experiment, S at first saw an 
Impressive array of electronic recording 
equipment and was then led into an ad- 
Joining room in which was located an 
isolated, screened cubicle. The latter 
contained a dental chair (sic) in which 


S was seated in a reclining position, 
a x radband was placed on his 
t A a plastic piece was fastened 
the beak a ea atien a mpleting 

he ions, the illumination in the 
ee E = reduced to a low level of 
co pte ae Sar trite 

f arose, he could get in touch 
with E by means of a microphone placed 
e S stand within his reach. The door 

e cubicle and the door leading to 

the adjoining room in which Æ worked 
were then closed and S was left in isola- 
tion. 

To say the least, these conditions were 
unusual and strange for Ss. Further- 
more, in order to maximize the likelihood 
that they would have a tendency to 
arouse some degree of apprehensiveness, 
only individuals who had no previous 
experience as an S in psychological labo- 
ratory experiments were used in all but 
one of our studies in Tables 1, 2, and 3. 
It has been our experience that students 
are much more likely to be concerned 
and apprehensive in the first experiment 
in which they serve. After one or two 
experiences they become much less fear- 
ful and, all too often, quite bored and 
blasé. Some experimental evidence, that 
the amount of such previous experience 
is a factor in experiments comparing the 
performances of the HA and LA Ss, has 
been provided by an experiment reported 
by Mednick (1957). The latter found 
that experimentally naive HA Ss showed 
more stimulus generalization of a re- 
sponse than LA Ss, whereas nO such 
difference was obtained in the case of Ss 
who had served in from two to three 
previous psychological experiments. 

It is evident from this account that 
the conditions under which our experi- 
ments were conducted were designed 
to arrange for some degree of ag 

d anxiety oF emotionality. Whi e 
poues y hypothesized 
one version 0 


f our theor. p 
that differences in emotionality of the 
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HA and LA Ss might be chronic, an 
alternative possibility was that it was 
dependent upon some degree of stress 
being present in the situation. We were 
primarily interested in testing the theory 
under conditions that maximized the 
likelihood of differences in the emotional 
reactions of the two groups of Ss. 

In the light of the above discussion, 
it is interesting to compare the condi- 
tions and procedures employed by 
Kimble and King with those employed 
at Iowa. First, with respect to physical 
conditions, there are a number of dif- 
ferences. The two Duke experiments 
were conducted in a cubicle within a 
room in which Æ was located. However, 
instead of being semidark, the cubicle 
was rather brightly lighted (23 apparent 
foot-candles). The experiment at the 
University of North Carolina was also 
conducted under bright illumination with 
S and E being in the same room on the 
opposite sides of a partition. In place 
of a dental chair, Ss in the Duke ex- 
periments were seated on a secretarial- 
type chair with chin in a headrest. At 
North Carolina, S sat in a straight desk 
chair with head free. In addition to 
these physical differences, most Ss in the 
Kimble-King experiments had already 
been in one or two experiments prior to 
serving in the conditioning experiment. 

Considering the differences in the two 
laboratories in terms of the use of dif- 
ferent degrees of isolation of S from E, 
semidarkness versus well-lighted room, 
dental chair (emotionally conditioned 
cue) versus neutral-type chair, and ex- 
perimentally naive versus sophisticated 

Ss, it would seem quite plausible to sus- 
pect that the differences in the findings 
could, in part at least, be due to the fact 
that the experimental conditions at Duke 
and North Carolina were not as emotion 
arousing as those employed in the Towa 
experiments. Further experimentation is 
needed, of course, to check on the role 
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of such factors for there are other dif- 
ferences between the two sets of experi- 
ments that might have played important 
roles. One important one that should 
be investigated is the possibility that 
cultural backgrounds of southern and 
northern students may lead to a dif- 
ference in the manner in which they 
respond to the different items in the MA 
scale. 

Still another variable is the Z-S inter- 
action. Our Es were not acquainted 
with Ss and we have attempted to instill 
in them an attitude of being quite im- 
personal at the beginning of the experi- 
ment and to avoid making any particular 
attempt to put S at ease or allay any 
expressed fears. Undoubtedly Es differ 
greatly in the degree to which they are 
able to achieve and maintain this objec- 
tive. It is interesting to note that the 
person who served as one of two Es in 
the single Iowa study that gave negative 
results (LA > HA) has recently com- 
pleted a doctoral dissertation in which 
there again was little difference between 
HA and LA Ss. Unfortunately, there 
were only a small number of the HA and 
LA Ss under each of the different con- 
ditions of this experiment and the pro~- 
portion of male and female Ss was not 
equated so the results are not very help- 
ful. This is, nevertheless, a potentially 
important variable and should be inves- 
tigated further, possibly by deliberately 
manipulating the behavior of Æ. It has 
recently been shown experimentally that 
one can markedly increase the condition- 
ing performance of Ss by emotion pro- 
ducing instructions (Spence & Goldstein, 
1961). 

A final variable to be considered is 
the presence or absence of a ready signal. 
In a recent article Prokasy and Whaley 
(1962) have proposed that the use or 
nonuse of a “ready-blink” signal is a 
possible factor with which emotionality 
might be interacting in these condition- 
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ing studies, In support of this notion 
these investigators found, as is shown 
in Table 2, that in differential condition- 
ing, their HA Ss gave a significantly 
larger number of CRs to the positive 
CS than LA Ss when a ready-blink signal 
was used, whereas only a small, insignifi- 
cant difference was obtained in its 
absence. However, before this study 
even appeared there was evidence against 
the notion that the difference in condi- 
tioning performance of the HA and LA 
Ss was a function of the presence or 
absence of a ready signal. As Table 1 
reveals, Baron and Conner (1960) had 
reported a highly significant difference 
(p < .01), while Spence and Weyant 
(1960) also obtained a significant F 
(< .05 level) for HA and LA Ss in ex- 
periments that did not employ a ready 
signal. Further evidence against this 
view that a ready signal is a necessary 
condition for the difference to occur is 
provided by the findings of recent experi- 
ments from our laboratory presented in 
Table 4. As the last three items in this 
table show, three comparisons between 
HA and LA Ss in which there was no 
ready signal all gave differences in favor 
of the HA group with the two com- 
parisons that involved more than 35 Ss 
being significant at the .05 level. 
Unfortunately, a serious methodologi- 
cal error is present in the early studies 
that did not employ a ready signal. As 
Hartman and Ross (1961) have re- 
cently demonstrated the latency criterion 
employed in the Iowa laboratory 
(Spence & Ross, 1959; Spence & Taylor, 
1951) to identify and eliminate “volun- 
tary” responders from the data sample 
is not applicable when a ready signal is 
not employed. The use of this criterion 
was originally based on the finding in 
experiments using & ready signal and a 
CS-UCS interval of 500 milliseconds 
that the latency distribution of responses 
judged to be similar in form to voluntary 


eyelid closures did not overlap to any 
great extent with that for CRs, which 
are very different in form. Thus, it was 
possible to employ this more convenient 
latency property as the index of whether 
a response was a voluntary response or 
a CR. As employed in these studies 
all responses with latencies between 300 
and 500 milliseconds were counted as 
CRs, while responses whose latencies 
fell between 150 milliseconds (tone CS) 
or 200 milliseconds (light CS) and 300 
milliseconds were counted as voluntary 
responses. The Ss who gave 50% or 
more responses in the voluntary category 
were eliminated from the sample data. 
The reason for so doing was that the 
behavior of such Ss appeared to obey 
quite different laws from those holding 
for Ss who gave few such voluntary re- 
sponses. Among the differences impor- 
tantly related to our present concern are 
(a) the tendency to respond at an ex- 
tremely high level (approaching 100%) 
once the first such anticipatory volun- 
tary response is made and (b) the lack 
of any relation whatever between the 
frequency of responses given by such 
Ss and the level of the MA scale score. 
The effect of both of these tendencies is 
clearly that of reducing the likelihood 
of obtaining a significant difference in 
the conditioning performance of the HA 
and LA Ss, the first by greatly increas- 
ing the variance of the conditioning 
scores and the second by reducing the 
difference between groups that contained 
a number of such Ss. ’ 
Unfortunately, investigators who did 
not use a ready signal took over the 
latency criterion to identify voluntary 
responders without checking its TE 
priateness, with the consequence that Fa 
such Ss probably were not aliniate 
from their samples. The Prokasy an 
Fore (1959) study is especially mip 
in this regard as it involved avery a 
puff (3.0 psi), 2 condition that produces 
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a high incidence of such voluntary re- 
sponders. When one considers the small 
number of Ss (10 per group) in this ex- 
periment, the negative difference ob- 
tained is not too surprising. The 
presence of one or two more high re- 
sponding voluntary Ss in the LA group 
than in the HA group could easily have 
produced this result. The findings ob- 
tained with the no-signal groups in the 
later study of Prokasy and Whaley 
(1962) and one of the Duke studies also 
may be suspected of being affected by 
the presence of a number of voluntary 
responders who were not detected by the 
inappropriate latency criterion that was 
employed. 
An example of the effects of including 
the data of voluntary responders in a 
sample is provided by the recent studies 
(Table 4) from the Iowa laboratory 
that did not employ a ready signal. 
Having confirmed the findings of Hart- 
man and Ross (1961) that latency does 
not differentiate voluntary form re- 
sponses from CRs when a ready signal 
is absent, the identification of voluntary 
Ss in these studies was made in terms 
of the form of the eyelid responses. If, 
instead, the latency criterion had been 
employed none of the voluntary respond- 
ers detected by the form criteria would 
have been identified and thus would 
have been included in the samples. It 
is worthy of note that the inclusion of 
six voluntary responders (three in each 
group) in the case of the comparison 
involving the male Ss of our most recent 
experiment (Table 4, No. 5a) would 
have reduced the reported difference of 
14.2% between HA and LA Ss to 11.0%. 
Correspondingly, the significance of the 
difference would have been reduced from 
the .05 level to one of .16. In other 
words, the addition of only six volun- 
tary Ss to the sample of 67 would have 
changed the conclusion from one that the 
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difference is significant to one that it is 
not. 

In concluding this discussion, atten-- 
tion should especially be directed to the 
point that the behavior of human Ss 
in classical eyelid conditioning, while 
simple in form, is complexly determined. 
Recent research in this and other labo- 
ratories has revealed that higher mental 


processes (inhibitory and facilitatory + 


sets, etc.) play a much more prominent 
role in human conditioning than has 
sometimes been realized. Moreover, our 
lack of knowledge concerning these fac- 
tors has precluded our controlling them 
in any satisfactory manner with the 
result that the intersubject variance of 
conditioning performance is extremely 
large. This state of affairs has unfortu- 
nate consequences for the testing of 
simple theories of behavior which specify 
only associational and motivational con- 
structs (e.g., H and D) as determiners 
of excitatory strength of the conditioned 
responses. Obviously, the additional un- 
controlled variables minimize the role 
of these basic theoretical factors, tend- 
ing to hide their effects. Under such cir- 
cumstances it is necessary to have 
reasonably large samples so that the 
effects of these confounding variables 
are more likely to be equalized in the 
comparison groups. On the basis of the 
writer’s experience with conditioning 
data, experiments of the type discussed 
here should be required to have groups 
of at least 25 Ss. Unhappily, as we have 


seen, this has all too frequently not been 
the case. 
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PSYCHOPHYSICAL CORRELATES OF 
INTRATYMPANIC REFLEX ACTION! 


MICHEL LOEB 
United States Army Medical Research Laboratory, Fort Knox, Kentucky 


A review is presented of research on the intratympanic muscle reflexes. 
Among topics covered are a discussion of the methods employed for 
observation of reflex action, discussion of the natural Protection against 
traumatic noise afforded by the reflexes and its limitation, and discussion 


of various types of stimulation (especially acoustic and cutaneous) which 
may be used to elicit the reflexes. Also described are possible Practical 
uses of artificial reflex elicitation to provide enhanced protection against 
noise and for certain kinds of clinical diagnosis. Implications for damage 
risk criteria and for certain areas of psychological and physiological 


research are discussed. 


In the past several years there has 
been a considerable resurgence of in- 
terest in the mechanism of the intra- 
tympanic acoustic reflex (AR). As Ward 
(1961) and others have pointed out, 
this is true not only because of the po- 
tential practical value which artificial 
elicitation of the reflex might have in 
protecting the hearing of individuals and 
in clinical diagnosis but also because of 
the realization of other possible implica- 
tions. Recent research indicates that the 
intratympanic muscle action should pos- 
sibly be taken into account in explana- 
tions of masking, remote masking, the 
scaling of loudness, mechanisms of atten- 
tion, and intersensory effects, 


NATURE oF THE REFLEX 


The intratympanic muscle reflex is a 
reflexive response on the part of two 
small muscles of the middle ear. One, 
the tensor tympani, runs from the carti- 
Jaginous portion of the Eustachian tube 
to the manubrium or handle of the 
malleus (hammer); the other, smaller 
muscle, the stapedius, lies within the 

i is a modified version of a 
Bins ga the Seminar on Middle Ear 
Function at Fort Knox, Kentucky, 7 May 
1962. I am especially indebted to J. L; 
Fletcher and to other Es in iie 
Seminar for suggestions regarding the materia! 


included. 
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posterior wall of the middle ear and its 
tendon inserts on the head of the stapes 
(stirrup). 

The tensor tympani is innervated by 
a branch of the trigeminal nerve and 
may receive sympathetic innervation 
(Byrne, 1938). It has also been re- 
ported recently that the tensor is in- 
nervated by a branch of the glosso- 
pharyngeal nerve (Lawrence, 1962). 
The stapedius is innervated by a branch 
of the facial nerve. Seemingly the mus- 
cles receive little or no sensory innerva- 
tion (Blevins, 1963). In response to 
activating stimuli—usually loud sounds 
—the stapedius and tensor tympani pull 
in such a manner as to damp, limit, or 
reduce the extent of the ossicular vibra- 
tions and correspondingly damp, limit, 
or reduce the acoustic input to the co- 
chlea. The direction of pull of the mus- 
cles is at right angles to the axes of 
their respective ossicles in such a fashion 
that the direction of action of one is 
almost directly opposed to that of the 
other (Wever & Lawrence, 1954). It 
has long been known that the reflex is 
bilateral. Acoustic stimulation of either 
ear elicits the reflex in both ears (Ham- 
merschlag, 1899; Pollak, 1886). While 
it is apparent that the reaction to acous- 
tic stimulation is bilateral, there is some 
question as to whether the homolateral 
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and contralateral responses are equal in 
magnitude. Detailed descriptions of ten- 
sor tympani and stapedius anatomy and 
physiology are available elsewhere 
(Hilding & Fletcher, 1960; Kobrak, 
1959; Perlman, 1960; Wever & Law- 


rence, 1954). 
Tt has been suggested that the noise 


attenuating properties of the reflex are 
primarily attributable to action of the 
stapedius muscle (Galambos & Rupert, 
1959; Simmons, 1959). As some observ- 
ers have emphasized, the action of the 
muscle is somewhat analogous to that of 
the iris and blink reflexes of the eye. 
However, because of the sharp rise time 
and brief duration of certain acoustic 
stimuli, in comparison to the latency of 
the reflex, and the relatively long dura- 
tion of other acoustic stimuli, in com- 
parison with the fatigue time of the 
reflex, the protection offered the ear by 
the intratympanic reflex is not as ade- 
quate as that afforded the eye by the iris 


reflex. 


MEASUREMENT OF MUSCLE ACTION 


A remarkable diversity of techniques 
has been employed to measure the action 
of the intratympanic muscles. Large 
contractions may be seen with the naked 
eye or with a microscope, and somewhat 
smaller ones may be viewed by observ- 
ing objects inserted into the muscle ten- 
don. Hensen (1878) made direct ob- 
servation of the tensor tympani and 
demonstrated that the reflex was elicited 
by acoustic stimulation. With the stape- 
dius, an appreciably smaller muscle, 
direct observation is more difficult. Kato 
(1913) made myograms of stapedial 
movements by tying @ thread to the 
muscle and connecting it to 4 stylus on 
a rotating drum. Today myograms are 
usually made by monitoring the output 
of mechano-electronic transducers fixed 
in the muscle tendons (Wersäll, 1958). 

The movements of the middle eat 
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pone especially those of the tensor 
sige result in movements of the ear- 
rum. Stapedius contractions produce 
an eardrum extrusion; tensor tympani 
been directi a The:dram nas 
1899; Kato, 1913; Laeger 
Waar, 1923), but sm iarten nar 
E , but small movements are 
quite difficult to detect. Köhler (1909) 
— a small mirror to the drum and 
anes changes in the reflected light. 
it is possible to detect smaller 
movements with this technique, there is 
probably some loading of the membrane. 
Other related methods include measure- 
ment of changes in volume or pressure 
produced by eardrum movement (Mend- 
elson, 1961; Terkildsen, 1960; Tsuka- 
moto, 1934) and changes in acoustic 
impedance of the ear (Jepsen, 1951; 
Klockhoff, 1961; Metz, 1951). The 
acoustic impedance of the ear may be 
defined as resistance to motion on the 
part of aural structure when acoustic 
energy is applied. It is a function of 
the stiffness, mass, and resistance (fric- 
tion) of the structures and may be esti- 
mated by certain pressure Or acoustic 
measurements at the ear canal (cf. 
Zwislocki, 1961). Manometric, volu- 
metric, and impedance change tech- 
niques are objective (although some 
experimenter judgment is required re- 
garding presence or absence and magni- 
tude of response) and fairly sensitive. It 
is possible that the impedance change 
method is more sensitive to tensor tym- 
pani than to stapedius action (Reger, 
1960), and it is not always possible to 
determine both tensor and stapedius 
response on the same manometric re- 
cording (Mendelson, 1961). While the 
latencies of the muscle reflex vary, Over- 
lap is possible, and the changes pro- 
duced by the two may cancel. Moreover, 
neither the manometric nor impedance 
re gives a direct indica- 


change procedu: $ 
tion of actual protection afforded by the 
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reflex—the reduction of sound energy 
impinging on the oval window attributa- 
ble to action of the acoustic reflex. 


MEASURES OF ATTENUATION: REDUC- 
TION OF COCHLEAR MICROPHONIC AND 
CHANGES IN PsyCHOPHYSICAL INDICES 


One index of amount of protection 
afforded by the acoustic reflex is the re- 
duction in cochlear microphonics pro- 
duced by reflex activation (Galambos & 
Rupert, 1959; Simmons, 1959; Wever 
& Bray, 1942; Wiggers, 1937). A reduc- 
tion in the ipsilateral or contralateral 
response to a loud tone may be noted 
after introduction of an activating 
stimulus. Estimates of maximum at- 
tenuation range from 10 to 20 decibels 
(Galambos & Rupert, 1959) to 40 deci- 
bels (Wiggers, 1937). It is reported 
that protection is greatest for tones be- 
low 2,000 cycles per second (Simmons, 
1959), 

While the cochlear microphonic tech- 
nique has yielded the most data regard- 
ing amount of protection afforded by 
reflex action, it has certain limitations. 
Cochlear potentials are not readily re- 
corded from humans, although in the 
course of certain surgical procedures 
they have been recorded (Ruben, Bord- 
ley, & Lieberman, 1961). A more serious 
objection is that, at or above a mod- 
erately intense level, the cochlear micro- 
phonic no longer responds linearly to 
sound (Davis, 1960) and may even be 
distorted (Lawrence, Wolsk, & Schmidt, 
1962). 

Alternatively, attenuation may be 
estimated by changes in several psy- 
chophysical indices—notably absolute 
threshold, loudness, and temporary 

threshold shift. Békésy and Rosenblith 
(1951) are perhaps the earliest to re- 
port the use of psychophysical determi- 
nation to measure attenuation afforded 
by the acoustic reflex. They introduced 
a loud pure tone into one ear and ob- 
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served the attendant change of loudness 
in a tone presented to the contralateral 
ear. The changes observed indicated 
attenuation values of 5-10 decibels. 
Shapley (1954) introduced a loud ther- 
mal noise in one ear and noted changes 
in the loudness of a tone in the con- 
tralateral ear. (Loudness reduction was 
measured by requiring a subject to 
match the intensity of a tone presented 
during the contralateral stimulation to 
the intensity of a tone in the absence of 
such stimulation.) Attenuation values 
ranging from 6 to 15 decibels were ob- 
tained, There was some possibility, how- 
ever, of direct masking due to bone con- 
duction of the thermal noise from one 
side of the head to the other, Loeb and 
Riopelle (1960) employed a method in 
which observers were asked to track their 
threshold at 125, 500, or 1,000 cycles per 
second by a Békésy technique with and 
without a loud 2,000 cycles per second 
tone introduced into the contralateral 
ear. The difference in thresholds in the 
presence and absence of the contralat- 
eral 2,000 cycles per second tone was 
assumed to be an index of reflex attenua- 
tion. With test and activating tones at 
these frequencies, the attenuation 
through the skull is such that masking 
should not be a factor. Threshold shifts 
were quite small—approximately 3 deci- 
bels. Threshold shifts obtained by the 
conventional method of limits were of 
the same order of magnitude. In a sec- 
ond part of the same investigation a 
loudness balancing technique similar to 
that employed by Shapley (1954) was 
used, but a 2,200 cycles per second tone, 
rather than a thermal noise, was em- 
ployed to activate the reflex. The data 
indicated that the loudness of intense 
tones was reduced to a greater degree 
than that of tones of lesser physical in- 
tensity. It was suggested, on the basis 
of these experiments and the discrepancy 
between the threshold shift data and 
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earlier experiments employing cochlear 
microphonics, that the reflex might act as 
a limiter rather than an attenuator, so 
that intense sounds may be attenuated 
considerably and faint sounds relatively 
little. Wever (1962) states that this 
hypothesis is contrary to other observa- 
tions (not enumerated) and suggests in- 
stead a sort of constancy mechanism— 
a central process by which action of the 
reflex is taken into account. It is diffi- 
cult to see, however, how this latter 
explanation could account for the small 
threshold shifts obtained. It should be 


pointed out that the results of loudness 
balance experiments by Reger (1960) 
do not support the hypothesis of dif- 
ferential attenuation of faint and in- 
tense sounds, while data from someni 
similar experiments by Prather (1961) 
tend to corroborate the Loeb and Rio- 
pelle (1960) findings. 

An alternate explanation advanced 
(Loeb & Riopelle, 1960) was that sub- 
jects may be unconsciously—or Con- 
sciously—inhibiting the reflex during 
the psychophysical task. Smith (1943) 
and Reger (1960) have described indi- 
viduals who have voluntary control of 
the acoustic reflex. According to Reger 


1960), only 1 or 2% of the population 
d, but it is possible that a 


is so talente sible that ¢ 
much larger percentage may inhibit their 


reflexes unconsciously when it is advan- 
tageous to do so. In any event, it has 
been demonstrated that if loud high fre- 
quency thermal noise is introduced into 
one ear and the resultant contralateral 
threshold shift is measured, larger shifts 
(as much as 15 decibels) are obtained 
than when the activating sound is a pure 
tone (Loeb & Fletcher, 1961; Ward, 
1961). Fatigability of the reflex is also 
seemingly less when it is activated and 
maintained with a noise stimulus than 
when a tone is employed. Clicks are at 
least as effective in arousing and main- 
taining the reflex as noise (Fletcher & 


Loeb, 1962). Moreover, the difference 
is attributable to the changing quality 
of noise and click stimuli rather than 
their complexity, for square waves are 
no more effective than pure tones in pro- 
ducing contralateral threshold shift 
(Fletcher & Loeb, 1962). The fact that 
changing stimuli are more effective in 
producing contralateral threshold shift 
than steady ones might be interpreted 
as evidence supporting the notion that 
subjects may be somehow inhibiting re- 
flex action during psychophysical meas- 
urements. Presumably with ever-chang- 
ing activating stimulus, this should be 
harder to do. A more parsimonious ex- 
planation might be that reflex adapta- 
tion is very rapid for pure tone stimuli 
and less so for changing ones. Wersäll 

(1958) stated that the intratympanic 
muscle reflexes decline in strength over 
time, but that presentation of a tone of 

different frequency brings the contrac- 

tion back to full magnitude. Simmons 

(1959) noted that a frequency modu- 

lated tone produces a more sustained 

reduction of cochlear microphonics in 

the contralateral ear than a steady one. 

Whether adaptation of the reflex is suffi- 

ciently rapid to account for the observed 

differences in contralateral threshold 

shift is debatable. 

Tt has been suggested (Ward, 1961) 
that the contralateral threshold shift 
technique may in part measure “central 
masking”—a change in sensitivity due 
to pre-empting of neural pathways com- 
mon to the two ears. An auditory cen- 


tral masking effect has been demon- 
strated by Ingham (1959) at levels 


below those usually considered neces- 


sary for activation of the middle ear 


muscle reflexes. His results, however, 
indicate that central masking is to be 
expected when the stimulus in the mask- 
ing ear has its primary energy in the 
frequency region identical or adjacent 


to that of the test stimulus in the contra- 
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ear. Extrapolation from his Te- 
nage a that no central masking 
should be expected with the frequencies 
and intensities of activating and test 
tones employed in experiments on the 
acoustic reflexes cited above. However, 
it is possible that at high intensities 
some other form of central masking may 
be present. Studies of contralateral 
threshold shift on patients lacking the 
middle ear acoustic reflexes (e.g., pa- 
tients with Bell’s palsy or stapedectom- 
ized patients) may serve to evaluate 
Ward’s (1961) hypothesis. 

Another method of estimating reflex 
attenuation is the temporary threshold 
shift reduction (TTSR) technique 
(Fletcher & Riopelle, 1960; Fletcher, 
196la, 1961b; Ward, Selters, & Glorig, 
1961). In this technique a train of loud 
impulsive noises sufficiently intense to 
produce a reversible loss of auditory 
acuity is presented to the observer, but 
a tone loud enough to activate the re- 
flex is presented before each impulsive 
noise. (The impulsive noises themselves 
arouse the reflex, but the muscles do not 
achieve full contraction before the im- 
pulsive noises terminate.) Such a pro- 
cedure has been found to reduce con- 
siderably the amount of temporary 
threshold shift (TTS) produced by the 
traumatic stimulus. The average reduc- 
tion was comparable to that achieved 
by the use of earplugs known to attenu- 
ate pure tones by amounts ranging from 
15 decibels at low frequencies to 45 deci- 
bels at high frequencies. An average 
figure, however, is not very meaningful, 
More TTS at low frequencies was noted 

when subjects wore earplugs and more 
TTS at high frequencies when observers 
were protected only by reflex elicitation. 

Hilding (1961) employed a closely re- 
lated technique. He demonstrated that 
losses in cochlear microphonic responses 
in cats, following exposure to impulsive 
noise, may be reduced or eliminated by 
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presenting a tone loud enough to activate 
the acoustic reflex before each of the 
impulsive sounds. While such a tech- 
nique has obvious advantages, it must 
be used with caution. Simmons (1960a) 
has shown that cochlear microphonic 
changes do not necessarily correlate with 
changes in neural sensitivity. 

There have been some recent attempts 
to correlate certain physiological and 
psychophysical measures of reflex action. 
Loeb and Fletcher (1961) measured and 
correlated contralateral threshold shift 
and TTSR for 16 observers. Correla- 
tions between these were found to be 
negligible. Mendelson and Fletcher 
(1962) attempted to determine relation- 
ships between these same psychophysical 
indices and manometric indices of acous- 
tic reflex action. In this instance some 
data suggestive of functional relation- 
ship between TTSR and the manometric 
techniques were obtained, but the rela- 
tionships were not simple. A recent arti- 
cle (Loeb & Fletcher, 1962) presents 
data which may explain these results. 
It describes experiments in which esti- 
mates of reliability of contralateral 
threshold shift were determined. Tt was 
found that while contralateral threshold 
shift in response to high frequency 
noise or clicks occurred quite regularly, 
the magnitude of the threshold shift 
varied somewhat, and reliability was 
surprisingly low (approximately .50-.60 
immediately after introduction of the ac- 
tivating stimulus with experienced sub- 
jects and lower in magnitude at later 
intervals or with inexperienced observ- 
ers). Reliability of TTS was somewhat 
higher (approximately .70-.80 immedi- 
ately after termination of the traumatic 
stimulus with experienced observers, 
lower at later intervals or with inexperi- 
enced observers). The TTSR reliability 
was not determined, but it might be ex- 
pected to be somewhat lower than that 
of TTS due to variability in the reflex 
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action itself. With reliability of this 
magnitude it is apparent that nonsignifi- 
cant correlations between the indices 
might very well be obtained. While 
similar information for manometric and 
acoustic impedance change techniques 
is not available in the form of reliability 
coefficients, it is known that there is 
considerable within-observer variability 
in reflex action (Mendelson & Fletcher, 
1962). It is also quite probable that a 
certain amount of variance is introduced 
in reading and interpreting the record- 
ings obtained by manometric and acous- 
tic impedance change methods. 


LIMITATIONS OF REFLEX PROTECTION 


It has already been mentioned that 
the latency of the acoustic reflex is such 
that little or no attenuation of very 
short, intense impulsive sounds (e-8., 
gunfire) occurs under normal circum- 
stances. Various values have been given 
for the latency of the reflex. Some of 
this variance is attributable to the fact 
that the tensor tympani and stapedius 
have different latencies and some to the 
fact that latency may be defined either 
as the time before contraction begins or 
the time before it is complete. Kobrak 
(1959) proposed that the term “latency” 
be applied to the former interval and 
the term “contraction time” to the latter, 
and this convention will be followed 
here, He states that the contraction 
time of the stapedius is approximately 
20 milliseconds and that of the tensor 
tympani 200 milliseconds. — Wersäll 
(1958) states that latency varies some- 
what with stimulus intensity, averaging 
approximately 7-10 milliseconds for 
stapedius and 12-16 milliseconds for 
tensor tympani. (These values are for 
cats under superficial phenobarbital 
anesthesia; rather similar values were 
obtained with rabbits.) Contraction 
time (three cats) ranged from 22 to 38 
milliseconds for stapedius and from 56 
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to 122 milliseconds for tensor tympani. 
All of Wersäll’s values were obtained 
by myographic techniques. Klockhoff 
(1961) reported somewhat higher 
values, but, as he employed an acoustic 
: pedance change procedure in record- 
ing activity and elicited the reflex by 
> Nea means, the values should 
ssarily be comparable. With 
manometric procedures (Mendelson 
1961) it is possible to distinguish sta- 
pedius and tensor tympani action be- 
cause they produce opposite movements 
of the eardrums, but if the responses 
tend to overlap it is apparent that an 
accurate indication of latency and con- 
traction time would be difficult to ob- 
tain. Cochlear microphonic experiments 
(Galambos & Rupert, 1959) have indi- 
cated reflex (presumably stapedius) la- 
tencies of the order of 10 milliseconds. 
(Estimates of contraction time were not 
given and are not easy to determine from 
the records presented.) In any event, 
though estimates of latency and contrac- 
tion time vary, it would seem that 
stapedius contraction requires at least 
10 milliseconds and tensor tympani con- 
traction a longer period. There is some 
evidence that the tensor tympani re- 
sponse is part of the startle reflex pat- 
tern (Klockhoff, 1961) rather than a 
true response to acoustic stimulation, 
and that it plays little role in reducing 
acoustic input to the cochlea (Galambos 
& Rupert, 1959; Simmons, 1959). If 
so, then the effective time of response 
can be taken as being between 10 and 
50 milliseconds. Since many impulsive 
sounds have rise times less than .1 milli- 
second and durations ranging from 10 
to 75 milliseconds (Doelling & Kryter, 
1959), the protection against impulsive 
noise trauma afforded by the reflex is 


minimal. i 

Contraction time for the middle ear 
muscle reflexes in man is still not defini- 
tively determined. In a very recent 
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investigation by Ward (1962b) employ- 
ing the TTSR technique it was shown 
that to produce maximum reduction of 
TTS the activating stimulus should pre- 
cede the traumatic stimulus by at least 
150 milliseconds. This result implies 
either that most protection is provided 
by the tensor tympani or that the sta- 
pedius is fully contracted only after 
150 milliseconds. Since the first inter- 
protection is at variance with the results 
of other findings (cited above), it ap- 
pears probable that the contraction time 
for the stapedius in man exceeds 150 
milliseconds, 

A second limitation of the reflex at- 
tenuation is its gradual adaptation to a 
continuously presented sound. The rate 
of adaptation apparently varies with fre- 
quency, being relatively abrupt for a 
high frequency tone and considerably 
less so for lower frequency tones, Adap- 
tation apparently begins immediately 
but may not be complete after 1.75 
minutes (Simmons, 1959). If the fre- 
quency of the reflex eliciting, acoustic 
stimulus is continuously changing (Sim- 
mons, 1959; Wersiill, 1958) or if the 
stimulus is constantly changing in ampli- 
tude like a thermal noise or 
clicks (Fletcher & Loeb, 1962; Ward, 
1961), the adaptation time may be very 
much lengthened. The Practical con- 
sequence of the adaptation Phenomenon 
is that reflex action not only is ineffec- 
tive against short impulsive stimuli but 
also against relatively Sustained stimuli, 
However, it is probable that some pro- 
tection is offered under most circum- 
stances, as animals with sectioned 
stapedius muscles show greater losses 
after exposure to continuous noise (Ga- 
lambos & Rupert, 1959). 

After termination of the activating 
stimulus, relaxation begins almost im- 
mediately, if the stimulus is at reflex 
threshold intensity, or shortly afterward, 
if the stimulus is considerably above 
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threshold (Weiss, Mundie, Cashin, & 
Shinabarger, 1962). However, studies 
employing acoustic impedance change 
(Metz, 1946), myographic technique 
(Wersäll, 1958), and cochlear micro- 
phonic reduction techniques (Galambos 
& Rupert, 1959) all indicate that some 
contraction may be present for a period 
of as much as 1 second, though the re- 
laxation period is generally shorter. 

In addition to the differential adapta- 
tion rate for high and low frequencies 
previously mentioned, there is also ap- 
parently a differential protection accord- 
ing to frequency (Simmons, 1959; 
Wever & Bray, 1937 ; Wever & Law- 
rence, 1954). In general, it is not en- 
tirely clear whether the difference is due 
to differential elicitation of the reflex 
by tones of different frequency or dif- 
ferential attenuation of tones of different 
frequency, though the Wever and Bray 
experiments suggest that the latter ex- 
planation is correct, (They found that 
when the tendons were pulled by ex- 
ternal weights, attenuation was greater 
for lower frequency tones.) In an ex- 
periment by Wever and Vernon (1956) 
tension along the line of the tensor tym- 
Pani was related to changes in cochlear 
microphonic. While some animals 
showed fairly good relationships be- 
tween tension and microphonic reduc- 
tion, others did not. The stapedius 
activity was apparently not measured to 
any considerable degree. In any event, 
Present data indicate that little protec- 
tion may be expected from the reflex at 
high frequencies, especially above 7,000 
cycles per second (Simmons, 1959). 
Whether this is true if the reflex is 
artificially excited remains an open ques- 
tion. 

The threshold for elicitation of the 
reflex is usually taken to be 70-90 deci- 
bels (Wersäll, 1958). Apparently the 
threshold is somewhat lower at higher 
frequencies (Simmons, 1960b; Wersäll, 
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1958), and it also varies with duration 
(Weiss, 1962). With repeated stimula- 
tion, posttetanic potentiation may occur 
and the reflex threshold may be lowered 
approximately 24 decibels (Simmons, 
1960b). It should be reiterated that at 
levels below the reflex threshold the 
muscles apparently maintain a state of 
contraction such that some sound at- 
tenuation is provided (Lawrence, 1960; 


Simmons, 1959). 
It is usually assumed that the acoustic 


reflex is completely consensual (Wever & 
Lawrence, 1954). Studies of nonacous- 
tic elicitation of the reflex indicate that 
reflexes so elicited may be completely 
homolateral (Klockhoff, 1961) or par- 
tially so (Pichler & Bornschein, 1957). 
There is also evidence that the reflex to 
acoustic stimulation may not be as great 
in the contralateral as in the homo- 
lateral ear. An experiment employing 
the acoustic impedance change tech- 
nique (Möller, 1961) and another em- 
ploying a loudness balance technique 
(Reger, 1960) yielded data best inter- 
preted in this fashion. 


NoNACOUSTIC ELICITATION 


It has long been known that contrac- 
tion of the middle ear muscles may be 
elicited by nonacoustic means. Wever 
and Lawrence (1954) reviewed the 
claims of several investigators who de- 
scribe voluntary contractions of the 
middle ear muscles. They suggested 
(citing Schrapinger) that some (not all) 
individuals who believe that they are 
contracting the tensor tympani are actu- 
ally contracting the veli palatini, a 
muscle of the palate attached to the 
Eustachian tube. Smith (1943) re- 
ported an individual with voluntary 
control of middle ear muscles and meas- 
ured changes of threshold during con- 
traction. It is interesting that the 
changes in loudness reported during con- 
traction were not necessarily correlated 


with threshold changes. Reger (1960) 
also reported individuals with voluntary 
control and described results somewhat 
similar to Smith’s. Reger’s findings pre- 
viously cited indicate that only 1-2% 
of the population has voluntary control 
of muscle movement. 

Since, through conditioning, a number 
of responses have been shown to be 
capable of elicitation by previously neu- 
tral stimuli, it seems at least possible 
that the intratympanic muscle response 
may also be so elicited. Simmons, Ga- 
lambos, and Rupert (1959) have in fact 
reported that if light flashes are repeat- 
edly paired with loud auditory stimuli 
the light flashes alone will eventually 
elicit the muscle response. Ward and 
Fleer (1961) on the other hand, found 
that there was no difference in TTS pro- 
duced by trains of high intensity clicks 
with periodic and aperiodic interclick in- 
tervals. This indicates that the reflex 
may not be temporally conditioned. 

A number of researchers have re- 
ported eliciting middle ear muscle re- 
flexes with cutaneous stimuli. Kato 
(1913) reported contractions in response 
to various types of aural and facial 
stimulation. Wersäll (1959) elicited 
muscle contraction by mechanical stimu- 
lation of the pinna and auditory canal. 
Klockhoff (1961) reported that electri- 
cal stimulation of the auditory canal 
produced a unilateral stapedius reflex, 
while tensor tympani action could be 
elicited only by an orbital air jet. The 
latter response was interpreted as being 
a component of a startle reaction. Pich- 


ler and Bornschein (1957) report a 10- 
15 decibel increase in the homolateral 
threshold, attributed to intratympanic 
muscle action, as a result of electrical 
stimulation of the external auditory 
canal. A small contralateral threshold 
shift (up to 5 decibels was also re- 
ported). Attenuation of lower frequen- 


cies was appreciably greater than that 
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for higher frequencies, and by anesthetic 
techniques the possibility of direct 
stimulation of the muscles at the inten- 
sities employed was ruled out. Djupes- 
land (1962) elicited homolateral _and 
contralateral stapedius reflexes (indi- 
cated by impedance change) and an 
occasional small tensor tympani response 
by aural air puffs. It is not certain 
whether the reflex was due to cutaneous 
or acoustic stimulation. 

Hugelin, Dumont, and Paillas (1960) 
report data indicating elicitation of re- 
sponse of the intratympanic muscles by 
action of the mesencephalic reticular 
system. This is discussed below. 


APPLICATIONS 


The most obvious application of the 
research reported is in construction of 
a new kind of ear protective device. 
Fletcher and Riopelle have applied for 
a patent on an apparatus which will 
present tones or other reflex eliciting 
stimuli a short period before the presen- 
tation of impulsive sounds. Such a de- 
vice would have advantages over other 
Protective devices—for example, ear- 
muffs or earplugs—in that no active 
cooperation on the part of the subject is 
necessary. (Anyone who has tried to 
elicit such cooperation will appreciate 
the importance of this factor.) It is 
probable that more effective reflex elicit- 
ing stimuli will be discovered. Chisman 
and Simon (1961) Suggest that reflex 
eliciting tones of gradual onset might be 
less injurious. However, Fletcher’s data 
(1961a) indicate that his activating 
tones with abrupt onset did not induce 
appreciable auditory fatigue and so pre- 

sumably are not injurious. In addition, 
there is evidence that tones of abrupt 
onset elicit the reflex more effectively 

Jeiss, 1962). s 

P resent indications are that the arti- 
ficial elicitation of the reflex will p- of 
practical value primarily for — 
against impulsive rather than continu- 
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ous traumatic noise. An attempt to re- 
duce TTS following exposure to loud 
continuous noise by presenting a mod- 
erately loud monaural tone during ex- 
posure to the noise was unsuccessful 
(Fletcher, 1961b). However, when either 
narrow band noise or a train of clicks 
was presented monaurally during a 25- 
minute exposure to broad band noise 
in the contralateral ear, the TTS pro- 
duced by the broad band noise was 
significantly reduced (Fletcher, 1961a). 
Whether protection of this kind would 
be noted following a longer exposure and 
whether a system incorporating the re- 
flex eliciting principle would be practical 
for long continuous-noise exposures is 
not known, but further exploration of 
these problems is planned. 

A second major application of the 
findings is the determination of a crj- 
terion for exposure to impulse noise. 
Rather widely accepted criteria for safe 
exposure to continuous noise already 
exist (Glorig, Ward, & Nixon, 1961), 
but comparable standards for exposure 
to impulsive noise are not available. In 
the past, it was generally assumed that 
as impulsive noises are more closely 
spaced, opportunity for recovery de- 
clines, and for the same number of im- 
pulses, the danger to hearing increases. 
However, if the impulses are spaced 
together closely enough, the reflex acti- 
vated by one impulse should reduce the 
inner ear input resulting from the suc- 
cessive one (Ward, 1962a). Carter and 
Ball (1962) have actually demonstrated 
such an effect. They find that intense 
impulses presented at 5 and 10 pulses 
per second are less damaging than the 
same number of impulses presented once 
per second. Similar results have been 
reported by investigators at the Army 
Human Engineering Laboratory at 
Aberdeen, Maryland.2 


?R. Donley and G. Smith, personal com- 
munication, 1961. 
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A third major application would be 
the determination of reflex excitability 
for use in clinical diagnosis. Klockhoft 
(1961) presented data indicating that if 
a stapedius reflex may be elicited, either 
through acoustical or electrical stimula- 
tion, some middle ear conduction must 
be present. Djupesland (1962) pre- 
sented data indicating that this prin- 
ciple may not be entirely general, but 
his method of reflex elicitation (aural 
air puff) differed from Klockhoff’s some- 
what. Absence of such a response would 
indicate conductive loss. If a tensor 
tympani response may be elicited (by 
orbital air puff stimulation) and the 
stapedius reflex may not, this is indica- 
tive of otosclerosis involving stapes 
ankylosis. Failure to elicit either reflex 
indicates a more extensive conductive 
loss. Failure to elicit the stapedius re- 
flex when auditory function appears 
normal and the tensor tympani response 
is readily elicitable may be taken as a 
diagnostic sign for facial nerve lesion 
(Jepsen, 1955; Klockhoff, 1961). 


IMPLICATIONS FOR RESEARCH 


It has been suggested that changes in 
remote masking (masking of low fre- 
quency sounds by high frequency 
sounds) may be attributable in part to 
acoustic reflex activation (Burgeat & 
Hirsh, 1961; Ward, 1961). Tt is proba- 
ble also that a great deal of the effect 
usually attributed to direct masking 
(masking of high frequencies by low) 
is in part due to acoustic reflex action 
rather than simply to overlap of acti- 
vated neurons. Tf so, the degree of mask- 
ing should change as a function of time, 
at least for steady masking sounds. 
Moreover, our perception of the relative 
loudness of sounds is in all likelihood 
influenced by acoustic reflex action 
(Reger, 1960, Ward, 1961). Reger sug- 
gests that the slope of the equal loud- 
ness contour for intense pure tones re- 


149 


flects this function. Loudness of pure 
tones might be expected to increase as 
a function of time due to reflex adapta- 
tion, but effects of this kind would be 
opposed by effects due to adaptation and 
fatigue. 

A number of researchers (e.g., Ga- 
lambos, Sheatz, & Vernier, 1956; Her- 
nández-Péon, Scherrer, & Jouvet, 1956) 
have demonstrated reduction in evoked 
potentials elicited by auditory stimuli at 
various central loci as a function of re- 
peated nonreinforced presentation or ex- 
posure to distracting stimuli as well as 
facilitation of such potentials following 
pairing with reinforcing stimuli. These 
phenomena have been explained on the 
basis of activity of the descending re- 
ticular activating system (Hernández- 
Péon, Scherrer, & Jouvet, 1956). 

Hugelin, Dumont, and Paillas (1960) 
report that reticular stimulation reduces 
dorsal cochlear nucleus potentials in 
“encéphale isole” cats. However, if the 
cats are curarized (thereby eliminating 
intratympanic muscle action) the effect 
is not obtained. They conclude, there- 
fore, that the physiological mechanism 
discussed involves reticular system con- 
trol of the intratympanic muscles. On 
the other hand, Mousheigan, Rupert, 
Marsh, & Galambos (1961) have shown 
that the habituation, conditioning, and 
distraction effects described may be re- 
corded in cats without middle ear mus- 
cles. The conflict of results may be more 
apparent than real. Desmedt (1960) 
has presented data indicating that de- 
scending pathways not in the reticular 
system may be responsible for centrifu- 
gal control of auditory input. If this is 
the case, then even if the reticular sys- 
tem does influence the action of the 
middle ear muscles, such action may not 
account in large measure for the periph- 
eral filtering of sensory input reported 
us investigators. However, the 


by vario 
possibility that the intratympanic mus- 


150 MICHEL LOEB 


cles may exert some such regulatory 
influence may not be ruled out at this 
S follows, further, that possible 
acoustic reflex action should be taken 
into account in other contexts. If sen- 
sory interaction experiments involving 
cutaneous facial stimulation and acous- 
tical stimulation are performed, possible 
elicitation of the reflex and resultant 
attenuation of the sound should be con- 
sidered. If a study of backward condi- 
tioning employs an air puff or an electric 
shock presented before a tone, it should 
be realized that the observer may, be- 
cause of reflex action, not be experienc- 
ing the tone at full intensity. Wherever 
an intense stimulus, especially a cutane- 
ous stimulus to the face or ear, is ap- 
plied concomitantly with or shortly pre- 
ceding an auditory stimulus, the possible 
effects of the reflex should be taken into 
account in interpreting findings. 
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NOTE ON NEED COMPLEMENTARITY 
IN MARRIAGE* 


GEORGE LEVINGER 
Western Reserve University 


This note reviews some conceptual and methodological issues concerning 
the need-complementarity hypothesis discussed in Tharp’s (1963) paper 
on psychological patterning in marriage. It is suggested that, in order 
to make a valid assessment of the hypothesis, it is necessary: to revise 
the accepted distinction between need complementarity and need simi- 


Jarity, 


to provide a more explicit theoretical basis for deciding which 


needs can be expected to be complementary, and to distinguish opera- 
tionally the different sources of need gratification. 


Tharp’s (1963) recent review of 
studies on the psychological patterning 
in marriage presents incisive observa- 
tions about knowledge in this area. His 
comments on hypotheses and on find- 
ings are extremely useful. 

It seems necessary, though, to sup- 
plement his discussion of Winch’s hy- 
pothesis on complementary needs in 
mate selection. To quote Tharp (1963), 
“Tn marriage research, no other hypothe- 
sis produced in the last decade has been 
as influential [p. 107].” Yet, after 3 
pages of methodological critique of 
Winch’s study, Tharp concludes “the 
complementary-need hypothesis as now 
stated is untenable [p. 110].” 

This writer agrees with Tharp’s gen- 
eral conclusion. Nevertheless, in order 
to understand why the evidence in favor 
of Winch’s hypothesis is thus far of 
dubious quality, it is important to go 
beyond pointing at methodological weak- 
nesses in existing studies. One must also 
examine the conceptual issues that are 
involved. 

Three issues to which little previous 
notice has been drawn are the following 


1The writer’s concern with these issues 
has arisen in the course of research on marital 
interaction, supported largely by Grant M- 
4653 from the National Institute of Mental 
Health, United States Public Health Service. 
Discussions with Barbara Allan, who is doing 
empirical research related to the hypothesis, 
have been helpful. 


ones. 1. There exists a logical confu~ 
sion in the accepted conceptual distinc- 
tion between complementarity and simi- 
larity of needs. 2. There exists no 
explicit theoretical basis for deciding 
which needs are complementary with 
which others, and existing studies are 
obscure in their basis of selecting such 
need pairs, 3. The conceptual distinc- 
tion between internal and external 
sources of need satisfaction has been 
ignored operationally in most research 
studies that ostensibly bear on the 
complementarity hypothesis.* 

The significance of the need-comple- 
mentarity hypothesis has been recog- 
nized not only by those interested in 
marriage and family relationships, but 
also by other social psychologists con- 
cerned with interpersonal behavior in 
general (e.g., Newcomb, 1956; Schutz, 
1958; Thibaut & Kelley, 1959). From 
a theoretical standpoint, it is only rea- 
sonable to expect that the more Person 
A wants to give what Person B wants 
to receive (and the more B wants to 
give what A wants to receive), the easier 
it will be for the pair to form and to 
maintain a viable relationship. The more 
that such conditions exist, the greater 
will be the mutual reward of the pair 
members and the lower their cost of 


2Some other important conceptual issues 
pertaining to need complementarity have been 
treated by Rosow (1957). 


153 


154 


achieving it (Thibaut & Kelley, 1959). 
1. The Distinction between Comple- 
mentarity and Similarity: Complemen- 
tarity may be defined as “mutually 
supplying each other’s lack,” or ey 
ing to fill out or complete” (Webster S, 
1946). Many of the most productive 
human relationships are complementary 
ones: male and female in the sex act, 
seller and buyer in the exchange, and 
so forth. In most examples that one can 
think of, the parts that “complete” the 
relationship are different in either 
quality or quantity. And this has given 
rise to the belief—either implicitly or 
explicitly expressed in the marriage 
literature—that complementarity and 
similarity are mutually exclusive at- 
tributes of interpersonal relationships. 
Let us examine Winch’s definition of 
complementarity. If two persons, A and 
B, are interacting, the resulting gratifi- 
cations of both are considered “comple- 
mentary” if one of the following condi- 


tions is satisfied (Winch, Ktsanes, & 
Ktsanes, 1954); 


Type I: “the need or needs in A which are 
being gratified are very different in intensity 
from the same needs in B which are also 
being gratified [p. 243].” 
Type II: “the need or ne 
being gratified are differen 
need or needs being grati 


eds in A which are 
t in kind from the 
fied in B [p. 243).” 

According to Winch and his col- 
leagues, if A is high on need Dominance 
and B low on this same need, Type I 
complementarity would occur, They 
also assert that Type II complemen- 
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tarity would arise where A is high on 
need Dominance and B high on need 
Deference or on need Abasement. 

The above definition presents some 
sufficient conditions for a complemen- 
tary relationship, but it omits other such 
conditions. Thereby, it leads one er- 
roneously to the conclusion that if A’s 
and B’s needs differ neither in kind nor 
in intensity, then the relationship is not 
complementary. 

Once apprehended, the error of this 
impression is clearly demonstrable. Let 
us assume the existence of a person, the 
perfect embodiment of Aristotle’s ethi- 
cal man, all of whose needs exist in per- 
fect moderation. His need scores, on 
whatever scale one might care to use, 
fall exactly at the midpoint of the con- 
tinuum. If Winch’s definition were 
adequate, one would have to conclude 
that none of this man’s needs could be 
complemented, since needs of different 
intensity would be either too high or too 
low. Such a conclusion seems untenable. 

More tenable is the idea that A’s and 
B’s needs, same in kind and equal in 
intensity, will complement one another 
Properly when both members of the pair 
possess the need in moderate quantity. 
Table 1 illustrates the point for Type I 
complementarity of the need for Domi- 
nance. 

The implications of this reformation 
are both conceptual and methodological. 
Regarding the former, it is advisable 
that this important hypothesis be se- 
curely anchored to a sound logical foot- 


TABLE 1 

COMPLEMENTARITY, SIMILARITY, AND DIFFERENCE RELATIONS IN A NEED PARING 
Amount of Amount of B’s need for Dominance 
A’s need for a — 
Dominance High Moderate iow 
High l Similar Different Complementary 
Moderate Different Complementary and similar Different 
Low Complementary Different 


Similar 
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ing. Until that is done, fruitful research 
on this subject would be rather difficult 
to recognize. 

Concerning the latter, one may briefly 
observe that the so-called negative find- 
ings in studies of the need-complemen- 
tarity hypothesis (e.g., Bowerman & 
Day, 1956; Katz, Glucksberg, & Krauss, 
1960; Schellenberg & Bee, 1960) cannot 
be considered negative merely on the 
basis of low correlations between 
partners’ needs of the same kind (ignor- 
ing for the moment the relation be- 
tween needs of a different kind). For 
any given need, if in general both part- 
ners should indicate it in moderate in- 
tensity, then the correlation between 
the partners’ amounts would be reduced 
artificially to zero, and yet there would 
be complementarity indeed. 

2. Instances of Hypothesized Com- 
plementarity: Whereas the first issue 
is a relatively simple matter of logic, 
the second is a more complex conceptual 
and empirical matter. It concerns the 
question of where the domain of com- 
plementarity exists. For what needs 
can either Type I or Type H comple- 


mentarity be postulated? 
Tt seems sensible to say that need for 


Dominance is a case of Type I comple- 
mentarity; that is, that two partners 
are better off when one of them needs 
to make influence attempts and the 
other to accept them. However, it is 
much harder to understand the grounds 
on which Winch et al. (1954) hypothe- 
size Type I complementarity for other 
needs, such as need Achievement; it is 
obscure why A and B would be better off 
when one is high and the other low on 
need Achievement, as opposed to both 
being either high or low. For a maximal 
coordination of their efforts, maximum 
similarity of achievement motivation 
might well be most advisable. 

It is also obscure how Winch et al. 
(1954) reason about Type II comple- 


mentarity. For example, they hy- 
pothesize that Abasement-Hostility and 
Abasement-Nurturance are both com- 
plementary need pairs (p. 246). Winch’s 
published work does not provide a clear 
theoretical rationale for enabling one to 
decide what kind of complementarity 
might be asserted for what kind of need 
under what conditions. 

A more explicit approach toward this 
problem occurs in Schutz’ (1958) more 
limited theory of need compatibility. 
Schutz confines his concern to three di- 
mensions of need—inclusion, control, 
affection—all specifically interpersonal 
needs, While confining himself purely 
to instances of Type I need comple- 
mentarity, Schutz distinguishes care- 
fully between A’s wishing to express 
behavior and wishing to receive behavior 
along any given dimension, Concern- 
ing need for Control, for example, A’s 
and B’s needs are defined as compatible 
to the degree that (a) A wishes to ex- 
press the same amount of control toward 
B that B wishes to receive, and also 
(b) to the degree that A wishes to re- 
ceive the amount that B wishes to ex- 
press. 

Schutz not only presents a useful defi- 
nition of several kinds of compatibility, 
but he also has offered some formulae for 
deriving meaningful scores on need com- 
patibility, together with operations in- 
dicated by FIRO-B (Schutz, 1958).° 
So far, only one study of mate selection 
has reported the use of Schutz’ instru- 
ments (Kerckhoff & Davis, 1962), but 
its results lend some support to a hy- 
pothesis of need complementarity. It 

was found that, for pairs of college stu- 
dents who had dated each other longer 

3 One difficulty in Schutz’ approach is that 


1 
the respondent’s general 
ituted, his 


ried couples; 
adequately than, say, 
Personal Preference Scl 
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than 18 months, couples with high need 
complementarity on Schutz’ dimensions 
made more “progress” in their courtship 
than couples low in such complemen- 
tarity. 

3. Internal versus External Sources 
of Need Gratification: The third issue 
refers to the source of satisfaction of 
any person’s needs. An implicit assump- 
tion in marriage studies is that need-fit 
pertains primarily to the marital rela- 
tionship itself. Yet marriage involves 
more than a personal relationship be- 
tween two individuals, The partners are 
simultaneously involved, both individ- 
ually and as a couple, in interactional 
situations in the external environment. 
There is a substitutability of sources of 
need gratification for each person’s 
needs. Proverbially, it is accepted that 
the man whose dominance needs are 
frustrated at the office comes home to 
release his anger on wife, child, or dog. 
Or, conversely, the henpecked husband 
at home goes out to work to boss his 
employees. It is by no means necessary 
that individuals use the same pattern of 
need gratification within and outside the 
marital relationship. 

Although Winch et al. ( 1954) men- 
tioned the distinction between need 
gratification “within the marriage” yer- 
sus that “ in all other situations,” this 
distinction is unclear in their own pub- 
lished data analyses (e.g., Winch, 1958), 
Furthermore, in the work of other re- 
searchers (Bowerman & Day, 1956; 
Blazer, 1963; Katz et al., 1960; Schel- 
lenberg & Bee, 1960), the distinction is 

almost entirely ignored. The latter stud- 
ies, in employing the Edwards Personal 
Preference Schedule (Edwards, 1953), 
used a need measure specifically de- 
signed to measure needs in the general 
environment of the college student’s 
peers. Their negative findings merely 
demonstrate, therefore, that marriage 
partners do not show any consistent cor- 


relation between such general needs. 
These studies can hardly claim to test 
the complementarity of these needs in 
the specific marriage situation. 

Parenthetically, it is interesting to 
note that when Katz et al. (1960) re- 
wrote the Edwards Personal Preference 
Schedule items describing general need 
for Nurturance and need for Succorance 
to pertain specifically to spouse rela- 
tions, they obtained their most signifi- 
cant findings for complementarity. Until 
investigators take account of this issue 
more directly in their measurement of 
specific needs, there will be no further 
evidence to support the original hy- 
pothesis by Winch and his colleagues. 

This note has attempted to illuminate 
certain additional issues concerning 
the need-complementarity hypothesis, 
omitted in Tharp’s (1963) review of 
marriage research, In this writer’s 
opinion, the hypothesis, when properly 
conceived and operationalized, does have 
a viability that gives promise for con- 
tinued study.* 


4 While this paper was in press, additional 
evidence concerning the effects of one kind of 
need complementarity on marital interaction 
has been published (Katz, Cohen, & Castig- 
lione, 1963). 
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REPLY TO LEVINGER’S NOTE 


ROLAND G. THARP 
University of Arizona 


Levinger’s extension of the complementary-needs (CN) hypothesis is 
assessed. Although superior to Previous CN formulations, it remains a 


vague statement of faith, 
support. 


lacking theoretical precision and empirical 
The study of needs—complementary or otherwise—is not 


likely to be fruitful in marriage research. Examples from group dynamics, 
statistical learning theory, operant theory, personality theory, and phe- 
nomenology all demonstrate a general movement in psychology away 
from the examination of stable traits and motives and toward the use 
of stimulus situations as Predictive constructs. Husbands and wives are 
spurred to action by stimuli which are not operative in their lives as 


bartenders and secretaries, 
which are specific to spousehood. 


Levinger’s “Note on Need Com- 
plementarity in Marriage” has per- 
formed the valuable service of clarifying 
the conceptual basis on which the social- 
psychological construct of need com- 
plementarity rests, His own reformula- 
tion takes account of more permutations 
than does Winch’s system, and he has 
demonstrated that those who would pair 
people on the basis of needs must take 
account of complementarity, similarity, 
difference, compatibility, etc. In addi- 
tion, he points out that no theoretical 
basis exists whereby needs may be 
paired, and we are agreed that attempted 
pairings have failed to Predict social- 
psychological events. 

Thus Levinger has brandished the 

knife of conceptual clarity, and in the 
process, committed euthanasia, For he 
has enabled us to see that a complemen- 
tary-need hypothesis can only state, in 
effect, that some combinations of some 
needs in some pattern of quantity and 
quality will eventuate in something sat- 
isfactory. And since there is as yet no 
convincing empirical support for any 
such combination, the CN hypothesis 
remains but a vague faith. 

This faith is, I believe, based on the 
common-sense assumption that some 
sorts of people combine into more last- 
ing, productive, and mutually satisfying 
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Marriage research must specify the stimuli 


groupings that would other people. Of 
course. That it should be otherwise is 
inconceivable. But should we assume 
that this matching is on the basis of 
motives? Levinger argues that the CN 
hypothesis is viable when needs specific 
to the marriage situation are under in- 
vestigation. Far from omitting this 
issue, my original paper (Tharp, 1963b) 
States that “assessment of needs not 
specific to marriage is clearly not 
the logical entree to predictive study 
[p. 108].” Levinger and I are agreed 
that the constructs under scrutiny must 
be role specific. 

The methodological ramifications of 
this position are as follows. The con- 
cept “role” pertains to expectations, per- 
ceptions, and actions specific to a partic- 
ular situation. Thus it is inappropriate, 
in social-psychological research, to con- 
tinue the very operations traditionally 
employed to assess needs. To abstract 
Tesponse tendencies to a wide variety of 
stimuli (whether questionnaire items of 
ambiguous pictures) necessarily brings @ 
loss of the precision required to predict 
to a specific role, 

But the more serious challenge to the 
CN hypothesis should be made on the 
basis of the constructs which it attempts 
to order. We must take account of 
specific stimulus Situations, rather than 
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attempt to predict from “stable” intra- 
organismic states. This is a development 
apparent in the entire enterprise of be- 
havior analysis. Let us consider several 
examples, the first from the area of 
social psychology, in which examination 
of stable traits has given way to a closer 
inspection of specific conditions (Cart- 
wright & Zander, 1960): 

The conception of leaders as people who 
possess certain distinctive traits has not proved 
to be satisfactory. A “new view” of leader- 
ship is emerging which stresses the perform- 
ance of needed functions and adaptability to 
changing situations. According to this con- 
ception, groups are (or should be) flexible in 
assigning leadership functions to various mem- 
bers as conditions change [p. 492]. 

As another example, in learning 
theory, drive state (motive, need) was 
once held accountable as a principal 
multiplicative constituent of predictive 
equations; or as the “organizer” of guid- 
ing internal stimulation sequences. Cur- 
rent statistical learning theory considers 
drive inducing operations to affect only 
the make-up of the stimulus set, the 
rate of sampling of stimulus set ele- 
ments, or the probability that any one 
element will be sampled on any single 
trial (Estes, 1959). 

The most radical statement of this 
movement toward stimuli is found, of 
course, in operant theory, where all 
internal states are extirpated from ex- 
planatory statements. Because of the 
organism’s reinforcement history, the 
appearance of a discriminative stimulus 
results in a response. Thus the stimulus 
itself is motivating. 

Other areas of psychological theory 
have profited from greater emphasis on 
stimulus situation, however, without 
abandoning their traditional concern 
with cognitions, perceptions, and ex- 
pectations. In personality theory, the 
classical psychoanalytic position of 
heavy concentration on instinctual drive 
as the determiner of the form of indi- 


vidual behavior is undergoing steady 
revision. The neo-Freudians, the ego 
psychologists, and more recently Schach- 
tel in his treatment of the allocentric 
mode of perception, have demonstrated 
that the most satisfying and produc- 
tive human experiences are not to be 
found in need-dominated behavior but 
in the appropriate transactions with 
fully apprehended objects and situations 
(Schachtel, 1959). 

As yet another example, consider the 

phenomenological approach to man; 
phenomenology would appear to be the 
antithesis of operant theory, yet the two 
are in basic agreement as to the neces- 
sity for greater concentration on stim- 
ulus conditions rather than on internal 
states (van den Berg, 1955): 
[The psychologist] never learns to know the 
subject better than by going to the objects, 
to the things of his world... the relation- 
ship of man to world is so profound, that it 
is an error to separate them .... Who wants 
to describe man should make an analysis of 
the “landscape” within which he demonstrates, 
explains and reveals himself [p. 32]. 

Let us then profit from the common 
experience of our colleagues. It is 
entirely conceivable that a man or 
woman is spurred to action as spouse by 
stimuli which are not operative in these 
individuals’ lives as bartender or secre- 
tary. The first order of business for 
marriage research is the specification of 
the stimuli: what is it to be a spouse? 
We must be able to describe and 
quantify the various expectations, re- 
ciprocations, rights, and obligations 
which comprise the situation of spouse- 
hood (Tharp, 1963a). Perhaps some 
facets of this role structure will turn out 
to be complementary; it is more likely 
that some will, and others will not. But 
before concerning ourselves with pat- 

terning of elements, we had best identify 
the elements themselves. 

These issues are no different in mar- 
riage research than in the investigation 
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of any social group. We will be closer to 
knowing why some people form better 
groups than other people, when we are 
sure what the groups do, and when, and 
how. 
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CONTEMPORARY PROBLEMS IN ADAPTATION- 
LEVEL THEORY: 


A SYMPOSIUM 


WILLIAM BEVAN 
Kansas State University 


The following three papers were pre- 
sented in a symposium on adaptation- 
level (AL) theory held during the 1962 
meetings of the American Psychological 
Association, A fourth by Harry Helson 
(1964), dealing with current theoretical 
issues, included material which was dis- 
cussed in his 1963 address as a recipient 
of a 1962 APA Distinguished Scientific 
Contribution Award. 

The 1962 symposium on AL theory is 
the second organized by the undersigned. 
The first, offered at the 1958 APA meet- 
ings, was concerned with demonstrating 
the wide range of phenomena amenable 
to conceptualization within the frame- 
work of this point of view. It consisted 
of five papers dealing with intermodal 
anchor effects, reinforcement, social atti- 
tudes and action situations, and judg- 
ment within clinical populations. This 
program was well received, and, in light 
of the consistent interest in the AL 
model, an increasing concern for prob- 
lems readily accommodated by this theo- 
retical approach, and a growing empirical 
literature related to this point of view, it 
seemed desirable that such a program be 
repeated after 4 years. 

The modern principle AL origi- 
nated in Helson’s experiments on color 
constancy and contrast. In these experi- 
ments, he made the important discovery 
that these two perceptual phenomena are 

` complementary and that it is possible to 


make accurate predictions of the judged 
color of a test patch from information on 
three aspects of the stimulus situation: 
the reflectance of the patch, the lightness 
of its background, and the hue of the 
ambient illumination. Extended to other 
psychophysical dimensions, the AL 
principle, in the general case, was demon- 
strated to reflect the integration of three 
classes of variable: the focal stimulus, 
background, and residual stimulation. 
When determinants representing these 
three classes are properly identified and 
controlled, accurate predictions have 
been made for an impressive array of 
phenomena: anchor effects, series ef- 
fects, the effects of interpolated stimuli, 
shifts in indifference point as a function 
of change in the statistical properties of 
the order of presentation, interdimen- 
sional and intermodal effects, the effects 
of subliminal stimulation, the effect of 
context upon judgment, contrast and 
assimilation, psychophysical bowing, the 
effect of presentation interval upon ap- 
parent magnitude, the role of criterial 
attributes in judgment, the influence of 
expectancies upon judgment, the effec- 
tive intensity of reinforcers, and reac- 
tion time and vigilance as a function of 
patterns of input. 

The 1962 symposium was planned as 
a two-by-two design where the two 
variables under consideration were 
breadth of theoretical application and 
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degree of empirical articulation. There- 
fore, it included two general theoretical 
papers and two papers dealing with 
specific experimental issues. The power 
of the theoretical model was explored 
within the context of two areas of in- 
quiry: that of traditional psychophysics 
and sensory psychology, and that of per- 
sonality research. The papers by Helson 
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and Parducci represented the first- 
named area and those of Goldstone and 
Goldfarb and Block the second. 
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SEQUENTIAL EFFECTS IN JUDGMENT’ 


ALLEN PARDUCCI 
Universiiy of California, Los Angeles 


The absolute judgment of a stimulus is often affecte 7 i 

order of presentation of the preceding stimuli. a T 
is anchored by the 2 extreme stimuli, dampening these sequential effects 
except when the range is extended or when conditions favor shifts in the 
remembered values of the extreme stimuli. An experimental situation was 


developed to create the latter conditions. 


Successive identifications 


were required for long, randomized sequences of the same 2 stimuli 
The sequential effects obtained in this situation were used to evaluate 
alternative adaptation-level (AL) models, involving the simplest inter- 
pretations of the theory and also special assumptions about anchoring 
by extreme stimuli. This work illustrates a direction for further develop- 


ment of the AL theory. 


Trial-to-trial variations in adaptation 
level (AL) have been generally neg- 
lected by those concerned with con- 
text effects in absolute judgment. Al- 
though it is clear that there are system- 
atic fluctuations in the scale of judgment, 
even after considerable exposure to the 
stimulus series, the observed shifts ap- 
pear to be relatively minor. One assumes 
that if these local shifts in judgment 
could be reliably assessed, they would be 
found to reflect the same principles which 
describe the adjustment to the series as 
a whole, 

This assumption is rooted in the folk- 
Jore of the psychology of judgment. For 
example, after classifying a set of 
weights into absolute categories, like 
“heavy” and “light,” even a naive ex- 
perimental subject may volunteer that 
the same weight felt heavier when it 
followed a lighter weight than when it 
followed a heavier weight. We call this 
successive contrast, and it fits in well 
with the AL approach to judgment. Such 
contrast has usually been demonstrated 
with comparative judgments, as when 


1 Presented at the annual meeting of the 
American Psychological Association, St. Louis, 
5 September 1962. Research performed by 
Arthur Sandusky, Judith Brown, and Weyman 
Reams, and supported by a grant (M 5465) 
from the National Institutes of Health. 


an unjudged weight is interpolated be- 
tween the standard and comparison 
weight (e.g., Guilford & Park, 1931). 
The trouble is that the absolute judg- 
ments of our retrospective subject are 
not likely to show this kind of contrast. 
Insofar as his judgments actually vary 
as a function of the preceding stimulus, 
he is more likely to say “heavy” follow- 
ing a heavier preceding stimulus—as- 
similation. 
This kind of assimilation is rarely a 
very dramatic phenomenon; and even 
where clearly demonstrable (e.g., Garner, 
1953; Sherif, Taub, & Hovland, 1958), 
it can be assimilated into an AL ap- 
proach to judgment (Parducci & Mar- 
shall, 1962). But it is a problem, one 
which we have been happy to ignore in 
our efforts to describe the “bigger” phe- 
nomena of judgment. Thus, we typi- 
cally get ourselves a more reliable de- 
pendent variable by averaging all the 
judgments made to each stimulus— 
averaging across different stimulus se- 
quences, hoping that sequential effects 
will balance out. This may have been 
sound research strategy since sequential 
effects are relatively small and since this 
averaging has enabled us to map out 
various features of the stimulus context 
from which one can predict the overall 
163 
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scale of judgment (Helson, 1959; John- 
son, 1955). f 

However, we might now show a bit 
more deference to common sense and to 
our subjects phenomenology. We al- 
ready know that there are large trial-to- 
trial effects. The initial adjustment to 
the stimulus context often reflects quite 
marked shifts in AL, and similar ad- 
justment may occur repeatedly through- 
out the series of presentations. Why 
not submit these shifts in judgment to 
experimental analysis? The payoff may 
be a better understanding of the grosser 
phenomena which we obtain by averag- 
ing the trial-to-trial effects. 


EFFECT or EXTREME STIMULI 


Recent psychophysical research sug- 

gests the special importance of one of 
these grosser phenomena, the influence 
of the end values, the stimulus extremes, 
as determinants of absolute judgment 
(Parducci, 1963). There can no longer 
be much doubt that these values require 
special weighting in the AL equations. 
For example, an increase in the extreme 
upper value produces immediate down- 
ward shifts in judgment of the other 
stimuli in the series, The AL moves up 
toward the physical value of this new 
stimulus, However, there is often little 
change in judgment following subsequent 
withdrawal of the new extreme stimulus 
from the series. The AL does not very 
quickly return toward its normal level 
for the restricted series (Parducci, 
1956). This “direction-of-shift effect” 
appears consistent with the special role 
of the stimulus extremes. To some ex- 
tent, the subject uses his categories to 
subdivide the range of stimuli into 
proportionate subranges; and even after 
many trials in which the stimulus ex- 
tremes have not been presented, they 
still anchor his scale of judgment. 

The relative permanence of this end- 
anchoring in simple laboratory situa- 
tions may tend to obscure trial-to-trial 
changes in AL. It is as though the two 


ALLEN PARDUCCI 


extreme stimuli were constantly present 
as standards against which each of the 
successive stimuli are compared. The 
changes in the judgments of the dif- 
ferent stimuli are relatively small be- 
cause they are always being compared 
with the same standards. 

Detailed analysis of the initial judg- 
ments of a series should show large, 
systematic shifts in judgment following 
each presentation of stimuli with values 
falling outside the range of the previous 
presentations. However, since adjust- 
ments occur so quickly to extensions of 
the range, there would be a sparsity of 
data for detailed analysis. Furthermore, 
adjustments occasioned by the introduc- 
tion of new values might differ markedly 
from sequential effects occurring late 
in a well-practiced series, 

Systematic shifts in the remembered 
values, the traditional points of sub- 
jective equality (PSEs), for the ex- 
treme stimuli would provide one of the 
possible bases for sequential effects 
which occur later in the series. Insofar 
as the presentations of the series stimuli 
affect the PSEs for the extreme stimuli, 
just as interpolated and comparison 
stimuli affect the PSE for the standard 
in comparative judgment, sequential ef- 
fects would occur throughout the series. 
The first experimental problem is to find 
a situation for which the remembered 
values of the stimulus extremes would 
be relatively unstable. An experimental 
situation meeting this requirement would 
also encourage the operation of other 
habits of judgment. The tendency for 
subjects to use their alternative cate- 
gories of judgment with fixed relative 
frequencies should have greater influence 
when the anchoring of the extreme 
values is made less effective (Parducci, 
1963). Thus, a run of presentations 
from one portion of the range of stimuli 
should produce shifts in judgment 
away from the categories previously ap” 
plied to that portion of the range. 
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AN EXPERIMENTAL SITUATION 


The results to be described here were 
obtained using a number of different 
stimulus dimensions, but always with 
the same general experimental situation. 
In accordance with the presumed anal- 
ogy to changes in PSE for compara- 
tive judgment, the situation used a rela- 
tively small range of stimulus values so 
that even small changes in the remem- 
bered values of the stimulus extremes 
would markedly affect the scale of judg- 
ment, The subject is told, correctly, 
that there are two stimuli, for example, 
a light weight (100 grams) and a heavy 
weight (120 grams). The stimuli are 
presented one at a time over a series of 
300 trials. On each trial, the subject 
must identify which of the two stimuli 
(i.e., the lighter or the heavier weight) 
has been presented. He is given no feed- 
back concerning the correctness of his 
judgments. Subjects under different con- 
ditions receive the two weights with dif- 
ferent probabilities (e.g., the heavier 
weight may come on 80% of the trials, 
or on 50%, or on only 20% of the 
trials). In our efforts to find a suitable 
experimental situation, we have tried 
(besides lifted weights) lights varying 
in brightness, tones varying either in 
loudness or in pitch, and, most recently 
and extensively, two lights which vary 
in spatial position—the subject having 
to judge whether, in a darkened room, a 
small light has flashed in the left or in 
the right of the two possible positions 
(a third, fixed light being located far 
above these two variable lights). 


ALTERNATIVE MODELS 


Consider some alternative models for 
judgment in this situation. The simplest 
AL model, consistent with Helson’s 
weighted-mean approach, might treat 
AL on any trial as the mean of all the 
stimulus presentations up to and includ- 
ing that trial, the probability of a cor- 
rect judgment being a linear function 


of the difference between the value of 
the presented stimulus and AL on that 
trial: 


_n-l 1 
n=- ALgat 7 [1] 


P(Jn=1)=K(Sy—-ALn) +C [2] 


where AL, is the adaptation level on 
Trial 7, S, is the stimulus value (e.g., 1 
or 0) on Trial z, P(J,=1) is the proba- 
bility that the judgment on Trial x is 
“heavier” (or “brighter” or “left”), and 
K and C are empirical constants repre- 
senting the discriminability of the stim- 
uli and the response bias, respectively. 
Since the influence of S,, upon AL, varies 
inversely with 7, this model entails that 
the probability of a correct response 
should become increasingly independent 
of features of the stimulus sequence 
(e.g., the length of homogeneous runs) 
as the number of trials increases. This 
has been the general observation for ex- 
periments on absolute judgment, the sys- 
tematic changes in the judgment of each 
stimulus being small after an initial 
adjustment to the series. Although this 
model appears consistent with a fair 
body of research, our data from the pres- 
ent situation show little difference be- 
tween the sequential effects found in the 
first versus the second 150 trials, even 
though fairly marked _ trial-to-trial 
changes appear throughout the series. 

Another AL model * seems more con- 
sistent with certain developments in Hel- 
son’s thinking (e.g., 1948). This second 
model also assumes that AL on each trial 
is a weighted average of AL on the 
preceding trial and the stimulus occur- 
ring on the present trial. But for this 
model, the weighting is the same for 
every trial: 

AL, =aAL,a+(1-@)Sn [3] 


2First presented by Seward (1961) for 
describing the effects of incentive magnitudes, 
and independently developed for perceptual 
judgments by N. H. Anderson (unpublished 


manuscript). 
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where a is an empirical constant which 
remains constant over trials. The law 
of judgment is the same as for the first 
model (Equation 2). Thus, AL shifts 
toward each successive stimulus, moving 
a constant proportion (independent of 
trial number) of the difference between 
where it was and the value of the pre- 
sented stimulus, Unlike the first model, 
this one does not necessarily entail 
changes between first-half and second- 
half sequential effects (if the rate of 
change is large). It does, however, en- 
tail that AL approximates the mean of 
the stimuli with long, randomized series 
—in this sense explaining this AL phe- 
nomenon obtained by averaging over 
the entire series. It also predicts that 
the probability of a correct response will 
decrease as a function of the length 
of a homogeneous run (e.g., a series of 
presentations of the lighter weight). 
That is, AL approaches the value of the 
repeated stimulus, decreasing the dif- 
ference between AL and that stimulus. 
So the likelihood of a correct response 
should decrease. And this is what hap- 
pens. Subjects are much more likely to 
be correct on alternation than on repeti- 
tion trials, the difference being on the 
order of 75% correct when the stimulus 
shifts as compared with only 60% cor- 
rect on trials when the stimulus repeats 
(for the degree of stimulus variation 
used with several different physical di- 
mensions studied for the present situa- 
tion). 

Another prediction from this model is 
that the probability of being correct on 
the stimulus breaking a homogeneous 
run (i.e., on alternation trials) will in- 
crease as a function of the length of the 
broken run. The AL should move to- 
ward the repeated stimulus so that the 
difference between AL and the stimu- 
lus which finally breaks the run will 
get progressively larger—increasing = 
probability of a correct response when 
the stimulus run is finally broken. This 
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model fails here. For every dimension 
studied, the probability of a correct 
response appears to be independent of 
the length of the prior run on the alter- 
nate stimulus. 

A third derivation (from both of these 
AL models) is that the probability of 
each response (e.g., “brighter’’) is inde- 
pendent of the overall probability of 
presentation of the associated stimulus. 
This is an interesting prediction since 
the stimulus values yield correct dis- 
crimination on more than two thirds of 
the trials for a 50% schedule. One 
might have expected, for example, more 
“brighter” judgments when the brighter 
light was presented on 80% of the 
trials (as compared with conditions for 
which it was presented on only 50% 
or 20% of the trials), Although con- 
trary to common sense, this prediction 
seems central to the general approach 
taken by AL theory. The initial re- 
sults were most intriguing, the mean 
percentage of “brighter” responses, for 
example, being within one percentage 
point for all three schedules (56%). 
However, subsequent work using judg- 
ments of two different positions of lights 
indicates that this prediction does not 
always hold, the probability of a “left” 
response shifting directly with the prob- 
ability of the light flashing in the left 
position (.53, .50, and 47, for the .80, 
-50, and .20 schedules, respectively). 

A third model seems to handle the 
data better than either of the two al- 
teady described. In accordance with 
the special role of the extreme stimuli, 
this model assumes that probability of a 
Correct response is a linear function of 
the difference between the PSEs or re- 
membered values of the two stimuli 
which are the end points of the stimulus 
distribution (there being just the two 
stimuli in the distribution), The model 
also assumes that the PSE for the 
presented stimulus is always the same 
value, the physical value of the presented 
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stimulus (e.g., 120 grams), and that 
the PSE for the stimulus not presented 
is a weighted average of its value on the 
preceding trial and the value of the 
presented stimulus: 


Sn =A5y-1+ (1—-@) Sp [4] 
P(Jn=1)=K(Sn-5n)+C [5] 


where s, is the PSE for the stimulus 
not presented on Trial #. This end-point 
model does better than the AL models 
since it not only predicts that the proba- 
bility of a correct response decreases with 
increase in the length of a homogeneous 
stimulus run, but also that the probabil- 
ity of a correct response to a stimulus 
alternation is independent of the length 
of the prior run. Furthermore, it cor- 
rectly predicts that the probability of 
one of the responses (e.g., “left”) will 
sometimes vary for different stimulus 
probabilities—depending upon the rate 
of change of PSE. 

Some caution must be expressed with 
respect to the adequacy of this simple 
end-point model. The declining function 
which represents the fall in the prob- 
ability of a correct response on homo- 
geneous stimulus runs flattens out on the 
second repetition and sometimes even 
rises—indicating that there may be an 
alternation bias in the subject’s re- 
sponses. While there is a general tend- 
ency (ie., in other situations) for judg- 
ments to be equally distributed over the 
different response categories, certain 
guessing habits may be specific to this 
simple situation. 

In spite of these reservations, it is 
clear that this situation yields marked 
trial-to-trial effects, systematic for each 
subject and also across subjects, and 
that these effects are associated with 
particular stimulus sequences. We thus 
have the opportunity to analyze the basic 
nature of the shifts in frame of reference 
which occur with each successive stimu- 
lus presentation. Why are the sequential 
effects so much greater here than for 


167 


the more complex experimental situa- 
tions with which absolute judgments are 
customarily investigated? Perhaps the 
use of end points which are themselves 
difficult to discriminate—only about one 
difference limen apart—may be crucial. 
However, the sequential effects for 
everyday, nonlaboratory judgments, with 
greatly separated end points, sometimes 
seem to be of much greater magnitude. 
Of course, such extreme experiences may 
be widely spaced in everyday life. The 
fact that we can get these trial-to-trial 
changes in a single laboratory session 
suggests that further investigation of 
sequential effects may provide a basis 
for significant developments in the AL 


approach to judgment. 
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A CONCEPTUAL FRAMEWORK FOR THE 
CLINICAL TEST SITUATION * 


WILLIAM E. BLOCK 
Yeshiva University 


The need to account for the adaptive behavior of the patient in the 
clinical test situation is discussed with respect to the relative rather than 


absolute nature of test responses. 


Adaptation-level (AL) theory is 


proposed as a framework of reference for adaptive behavior of the 
patient in the clinical test situation, as well as for clinical judgment and 
prediction. The theory of AL possesses the relativity and operational 
validity required for dealing with complex clinical phenomena. Implica- 
tions of the AL model for clinical theory, experimentation, and prac- 


tice are suggested. 


The use of projective techniques is 
one of the main approaches of the clini- 
cal psychologist in understanding the 
patient before him. The clinical test 
situation is a complex interaction of a 
patient with a past history, hopes and 
fears, and with varying motivations, who 
must adapt to a projective task char- 
acterized by more or less ambiguous 
stimuli (the ambiguity of which varies 
with patient expectations) administered 
in an interpersonal context by a skilled 
examiner whose idiosyncrasies, in turn, 
may influence the patient, Hence, the 
patient’s responses can only be regarded 
as relative to the totality of influences 
operating in this complex situation. The 
necessity arises, therefore, for a theo- 
retical framework broad enough to en- 
compass the effects of focal, residual, 
and contextual factors upon test be- 
havior. This article discusses the pos- 
sible application of the adaptation-level 
(AL) paradigm to problems of the clini- 
cal test situation. 


1 This paper was prepared under National 
Institutes of Health Grant M-6101, and is an 
extension of a study reported at a symposium 
on AL theory held at the 1962 meeting of the 
American Psychological Association in St, 
Louis. The title of the symposium paper was: 
“A Preliminary Study of Adaptation-Level 
Theory as a Framework for Projective Test- 
ing” (National Institutes of Health, M-5118A). 
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CLASSICAL VIEW OF THE CLINICAL TEST 
SITUATION 


The patient’s ostensible freedom from 
inhibitions of propriety, self-censorship, 
and right-wrong set in confronting the 
ambiguous task represented by projec- 
tive stimuli has been regarded as among 
the chief virtues of this type of test 
compared with, say, the objective per- 
sonality inventory in revealing deeper 
aspects of personality, hidden motiva- 
tions, etc. The response elicited by the 
projective technique is viewed as a 
“projection” of personality in a man- 
ner analogous to the way the hidden 
images of a film strip are brought to life 
by projection on a screen; another com- 
mon analog views the projective test as 
an X ray of underlying personality struc- 
ture. Protocol variables of main in- 
terest to the clinician have been those 
linked to “task performance,” such as 
responses, scores, and content, with little 
or no concern about another domain of 
variables centered on “task approach” 
or “adaptation.” 

Projective techniques are largely im- 
bued with face validity and the result- 
ing protocol taken as indicative of 
absolute traits of the patient, Other 
working assumptions about clinical test 
ing are the monolithic nature of the test 
situation, that is, its freedom from €% 
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traneous influences, the validity of the 
concept of “projection,” the inferences 
drawn from responses, and the simple 
addition of various tests to comprise a 
“battery.” 


CRITIQUE OF THE CLINICAL TEST 
SITUATION 


Despite consensus among clinicians 
of what transpires in the test situation, 
a number of unresolved questions emerge 
about these underlying assumptions. 

1. Projection is the accepted explana- 
tion of the process by which a subject 
produces a response to the ambiguous 
test features. Yet projection is descrip- 
tive rather than explanatory in the 
scientific sense. While it is a rich con- 
cept, the what, how, and why of projec- 
tion remain moot questions. Projection 
also involves a paradox—the notion of 
an unconscious illogical psyche making a 
logical cognitive effort to comprehend 
the projective task. 

2. Aside from the problems inherent 
in obtaining a test protocol, discussed 
below, the meaning of the protocol poses 
another problem. Not only may inter- 
pretative approaches vary within a given 
technique, but also from one technique 
to another. Interpretation may involve 
formal, content, dimensional, and holis- 
tic aspects. T hough all projective tech- 
niques are thought implicitly to involve 
projection, articulation of data for pur- 
poses of interpretation from various tests 
becomes an artistic, intuitive enterprise 
because of their unique metalanguages. 
Hence, the clinician more often than not 
utilizes dubious overarching conceptual 
canopies, procedurally rather than em- 
pirically validated (Block, 1962b), to 
encompass the variegated data of a 
multitest battery. 

3. Moreover, many projective tech- 
niques involve a priori assumptions 
about the meaning or significance of cer- 
tain responses as well as their interrela- 


169 


tionships without operationally anchor- 
ing either the basic concepts or rules of 
interpretation. 

4. Until recently, it was assumed in 
the classical measurement tradition of 
Galton that a test sampled a subject’s 
response to invariant stimuli in a stand- 
ard invariant situation administered by 
an inscrutable examiner. This mono- 
lithic view of the clinical test situation 
essentially holds that a sample at one 
time is representative of the patient on 
all other occasions. 

This view becomes untenable in the 
light of increased sophistication about 
the nature of psychological testing and 
a growing number of empirical studies 
(Masling, 1960) showing that the sub- 
ject’s responses are influenced not only 
by his personal needs, both residual and 
transient, but also by his appraisal of 
such situational variables confronting 
him as the mode of administration, test- 
ing milieu, and characteristics of the 
examiner. 

The gist of this evidence is that the 
subject’s response is a relative one tem- 
pered by adaptation to the here-now 
test situation, And though clinicians are 
becoming more cognizant of background, 
focal, and contextual effects in testing, 
there has been little effort made on their 
part to integrate adaptive behavior into 
the overall personality description of 
the patient. The fault does not lie with 
clinicians but with the inadequacies of 
test theories at hand. Projective psy- 
chology, as a body of theory, can neither 
take into account the patient’s adapta- 
tions nor make use of this important 
facet of behavior. 

5. Another questionable assumption 
is the objectivity of the examiner. It 
had been previously thought sufficient 
merely to provide the clinician with a 
set of rules of interpretation or princi- 
ples to insure objective scrutiny of pro- 
tocols. The evidence is that the clini- 
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cian’s personality does enter into his 
judgments of test data. Yet no way is 
known to mitigate this bias since there is 
no clear understanding of how judgments 
are formed in the first place. 

6. The absolutist tradition of testing 
assumed that the addition of stimuli (or 
tests) merely increased the range and 
variety of responses, and that each test 
was independent of others in the total 
battery. The fact that interactional 
effects occur between other components 
of the clinical triad (test-testee-tester) 
suggests the likelihood of interactions 
among the test stimuli vis-a-vis the 
patient as well. 

7. Most research on projective test- 
ing is focused on what is measured to 
the exclusion of the more fundamental 
question: How does the projective tech- 
nique qua measuring instrument elicit 
data? As the author has suggested else- 
where (Block, 1962b), projective tech- 
niques are similar to psychometric ex- 
periments albeit with unexplicated ra- 
tionale. As such, they are susceptible to 
the same class of errors and judgment 
biases that pervade any measurement 
effort. Once again, projective psychol- 
ogy is found inadequate to deal with 
these psychometric phenomena, 

The above critique implies a cogent 
need for a theoretical framework with 
construct validity to account for the 
diverse adaptive phenomena that oper- 
ate in the clinical test situation. The 
author would now like to state the case 
for AL theory in meeting this need 
based upon theoretical considerations 
and preliminary experimental findings, 


AL THEORY AS A FRAMEWORK FOR THE 
CLINICAL Test SITUATION 


The concept of AL, as formulated by 
Helson (1948, 1959), provides a frame- 
work within which the variables of the 
clinical test situation may be evaluated 
on a quantifiable, operational basis. De- 
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fined operationally, AL is the stimulus 
that evokes a neutral or indifferent re- 
sponse by the subject. Since the struc- 
ture of the behavioral field is fixed by 
the position of the neutral point, the 
frame of reference is completely defined 
once the value of AL is established. The 
AL is the pooled effect of stimuli inter- 
acting with each other (weighted-log 
mean). The existence of a neutral re- 
gion (AL) is important because it acts as 
a frame of reference for behavior and 
determines what reactions will be made 
to the specific stimuli in any behavioral 
situation. The AL also involves tem- 
poral pooling of residual, contextual, 
and focal stimuli, By virtue of its math- 
ematization, AL theory permits calcu- 
lation of the relative contribution of 
these stimulus sources to a given re- 
sponse or judgment. 

Thus AL theory deals with the here- 
now adaptive behavior of the individual 
taking into account past experiences, 
present needs, and future anticipations. 
These dimensions of past-present-future 
characterize the temporal aspects of 
normal existence. The theory of AL can 
deal with those aspects of behavior 
which convey a sense of stability to 
the individual by harmonizing inner 
needs with the outer demands aimed at 
establishing a stable, yet dynamically 
responsive, point of reference for be- 
havior. 

Being operationally defined, AL the- 
ory is a rigorous scientific system un- 
like the quasi-mystical concepts of pro- 
jective psychology. Moreover, since the 
theory deals with adaptive behavior, 
it satisfies the requirement of psycho- 
logical relevance too. And of special 
importance for the esteemed “subjec- 
tivity” of the clinical test situation, AL 
theory allows for phenomenological and 
existential aspects of behavior. 

Starting from simple psychophysical 
experiments, Helson and others have 
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systematically shown the wide applica- 
bility of AL theory to other psycho- 
logical domains such as vision, learning, 
social psychology, traumatic and neu- 
rotic behavior. Helson (1959) has postu- 
lated the extension of AL theory to 
psychological testing in this statement: 
conceptually responses to ink blots or test items 
in personality scales are no different from psy- 
chophysical judgment in being influenced by 
situational factors and momentary adjustment 
level of the organism [p. 605]. 


Murstein (1959) utilized the AL frame- 
work in analyzing stimulus research on 
the Thematic Apperception Test, though 
not demonstrating its application ex- 
perimentally. 

Block (1962a) conducted a prelimi- 
nary study to test the congruence be- 
tween AL theory and projective testing. 
Twenty subjects rated the Rorschach 
inkblots on the evaluative scale of the 
semantic differential following the usual 
free response phase. Factor loadings 
of each card on the evaluative scale 
were used as their objective stimulus 
value. The AL corresponding to the 
neutral point of pooled ratings was com- 
puted and then original mean judgments 
were postdicted on the assumption that 
such judgments had taken place with 
respect to a framework of reference de- 
termined by the prevailing AL. An F 
test of “goodness of fit” between ex- 
perimental and theoretical values was 
significant beyond the .01 level of con- 
fidence; this finding was cross-validated 
on another sample of 13 subjects. It was 
therefore possible to apply the AL 
paradigm not merely analogically, but 
by means of general quantitative, func- 
tional relations established between stim- 
ulus and behavioral variables in a pro- 


jective testing situation. 
BEHAVIORAL Dimensions or AL 


A paramount value of the mathemati- 
zation of functions is the possibility of 
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manipulating symbolic concepts to re- 
veal new dimensions of understanding 
and insight. This has been done in some 
AL formulations. 

. First, AL may be described as a pool- 
ing or interaction of effects. It is the 
end result of averaging processes in the 
organism, On the assumption of iso- 
morphism between behavior and its 
mathematical expression, Helson (1959) 
infers that AL exemplifies psychological 
integration, inasmuch as averaging 
operations mathematically are special 
cases of integration. This implies that 
the subject’s confrontation with the 
clinical test situation, as measured in 
AL terms, is an index of degree of 
psychological integration at that mo- 
ment. 

As a corollary of this integrative view, 
the manner in which residual (past ex- 
perience), focal (present needs), and 
contextual (milieu) effects interact tells 
something about the individual’s han- 
dling of the adaptive task he confronts 
in the test situation. This may be illus- 
trated by the mathematical definition for 
AL where there is pooling of existing 
levels of stimulation (Helson, 1959): 


AL=AL,™ AL" Sy 


where AL stands for adaptation level, 
S denotes the geometric mean of projec- 
tive stimuli; the subscript 7 for residual, 
c for contextual, f for focal; and expo- 
nents m, n, and e are the respective 
weighting coefficients of the various stim- 
ulus sources contributing to AL. Michels 
and Helson (1949) have suggested the 
following psychological equivalents of 
the weighting exponents which we shall 
reinterpret for the projective test situa- 
tion: 

m—this is the contribution of past experience, 
and the fact that it must be taken into account 
in adapting to any new situation or set of facts 
makes it psychologically the equivalent of re- 


sistance in the test situation. Such a meaning 
would be most useful perhaps in assessing the 
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subject’s openness to therapy or new experi- 
ences via test adaptation. 

i ibuti f the sur- 
—this represents the contribution o s 
eer test milieu, be it human, physical, 
social, etc., in short, the relativity of Tesponses 
to the prevailing situation. Psychologically, it 
may best be described as a measure of the sub- 
ject’s distractibility from the projective task 
at hand. 


e—this exponent represents the contribution to 
AL of the subject’s inner needs, their per- 
sistence and strength of manifestation in the 
face of competing residual and contextual in- 
fluences of the test situation. In a sense it 
represents self-adaptation, how one confronts 
inner tensions; its behavioral counterpart is 
sensitivity to one’s own needs in the face of 
external demands. 


Recapitulating, we may then say that 
adaptive test behavior integrates past 
experience, present needs, and back- 
ground influences. The important im- 
plication here is that so-called fixed per- 
sonality traits sought by clinicians are 
relative to the test situation in which 
they are called forth. The practice of 
searching for absolute traits as such is 
questionable on theoretical and empirical 
grounds. One may only hope to find or 
show relative constancy between given 
stimuli and responses despite the shift of 
the zero of functioning (AL) under 
changing conditions (Helson, 1959). 


AL THEORY as A MODEL or CLINICAL 
JUDGMENT AND PREDICTION 


The clinical psychologist appears to 
engage in two judgmental processes with 
respect to the data confronting him, 
whether test, interview, or observations. 
The first type of judgment involves de- 
scription, or classification of data; the 
second, more complex type, is the predic- 
tion of future behavior on the basis of 

present data. 

How clinical judgments are formed 
has been scrutinized carefully by a num- 
ber of researchers, one group of which 
is inclined to relegate clinical decision 
making to the realm of the artistic and 
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other such realms, implying refractori- 
ness to scientific explanation; while the 
opposing group views clinical judgments 
as rational and scientifically verifiable. 
The latter have increasingly used math- 
ematical models to describe clinical 
judgments. Hoffman (1960), for ex- 
ample, proposes both linear-additive 
and configurational-interactive models, 
while Bakan * offers a probability model. 

In the present author’s opinion, such 
mathematical models suffer by being 
paramorphic; that is, not really tied to 
the underlying processes purportedly 
represented. Contrasted with these, AL 
theory offers an isomorphic model, one 
having construct validity because of its 
verifiable relationships between stimuli 
and responses. Moreover the AL model 
embraces both interactive and proba- 
bilistic aspects of clinical judgment. 

As to the first type of judgment— 
classification or description—the general 
paradigm of AL theory for psychophysi- 
cal judgments would seem to apply; for 
in principle, any set of test responses or 
qualitative data that can be scaled and 
dichotomized is amenable to representa- 
tion in AL terms. Focal, contextual, 
and residual factors will be contribu- 
tory to the overall judgment for the 
given patient. 

The theory of AL applied to clinical 
prediction is more complex and may in- 
volve what Smedslund (quoted by 
Helson, 1959) defined as probability- 
adaptation; that is, the level of proba- 
bility expectation that is affectively 
neutral to the person at a given time 
and which is represented by the weighted 
average of all levels of probability ex- 
pectation influencing the given situa- 
tion. Thus probability adaptation is the 
pooling of present and past levels of ex- 
pectancy. Clinical prediction may there- 
fore be regarded as a statement of proba- 


2 In an address to the Graduate Psychology 
Club at Yeshiva University in 1961. 
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bility expectation about future behavior 
on the basis of previous expectations. 
For the clinician, the past or residual 
expectation is his “background of clini- 
cal experience” or general base-rate ex- 
pectancies about phenomena similar to 
the focal issue; present expectancy about 
a phenomenon is derived from viewing 
it against this past experience; future 
expectancy or prediction is then an 
interaction of these two probabilities 
according to the weighted-log mean 
formulation. The author has planned 
experiments to test this probability- 
adaptation model of clinical judgment. 


PERSONAL CHARACTERISTICS IN 
CLINICAL JUDGMENT 


Numerous studies (Masling, 1960) 
show that the clinician’s judgment of 
test protocols is influenced by his per- 
sonal and behavioral characteristics. 
Thus, another presumed absolute fea- 
ture of the test situation is found to be 
a relative one. What has been over- 
looked in these studies, though, is the 
effect of the stimuli upon the clinician 
and his subsequent judgment of a sub- 
ject’s protocol. What is suggested here 
is that the clinician’s judgments about 
the subject are contaminated, as it were, 
by his own adaptation to test stimuli. 
In AL terms, the clinician’s self-adapta- 
tion to projective stimuli acts as a con- 
textual or background influence in judg- 
ing the subject’s adaptation to these 
same stimuli. Helson’s (1948) general 
formulation for judgments with anchor 
stimuli would seem applicable here. This 
view is supported, too, by the study of 
Rethlingshafer and Hinckley (1954) 
showing the effects of judges’ attributes 
(age-weight) on judgment of the sub- 
ject’s age-weight according to the anchor 
effect described above. 

An AL model of clinical judgment 
may open the way toward greater under- 
standing of factors to be weighed in 
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making judgments, individual differences 
among clinicians in judging, and effects 
of training, personality, and idiosyn- 
crasies of the clinician in the interpreta- 
tion of test data. 


AL THEORY AND ASSUMPTIONS IN 
CLINICAL TESTING 


The clinical test battery is usually 
comprised of several techniques to pro- 
vide different kinds of information about 
the patient according to the nature of 
each component test. Two assumptions 
surround use of the clinical battery: (a) 
despite different underlying metalan- 
guages each test contributes to the same 
overall meaning and (b) each test is 
independent of the others in the battery. 

The first assumption about underlying 
similarities in the face of surface dis- 
similarities appears unwarranted be- 
cause: (a) different metalanguages of 
the tests probably present different 
meaning structures to which the patient 
must adapt and (b) combining tests for 
interpretation within some global theo- 
retical framework—say ego psychology 
—is gratuitous. These inherent differ- 
ences of meaning structure and adaptive 
tasks can be transcended, however, by 
viewing the response to the battery and 
its components within the framework 
of AL theory. This is in keeping with 
the growing shift from task performance 
to task approach. One may then trace 
the subject’s progress in adapting from 
one task to the next within the battery, 
each test presenting a unique adaptive 
task, but contributing to his overall 
adaptation. 

The assumption of independence of 
tests in the battery seems unwarranted 
because the subject’s responses may re- 
flect successive adaptation to different 
tests rather than an absolute response 
true at all times for a given test. The 
test battery may be viewed as a seri 
of stimuli (tests) in which AL prevails 


174 


as a function of the pooling or inter- 
action of the stimuli in the series. Hutt 
(1947) found variations in Binet IQ 
depending upon whether subtests were 
presented consecutively or adapted to 
the subject’s emotional needs of the 
moment. Heim (1955) showed that IQ 
varied depending upon whether the same 
type of item was imbedded in easy or 
hard tests, and that subjects either 
adapted immediately or in successive 
stages as function of difficulty level. 
Dollin and Sakoda (1962) have just 
shown sequential effects of cards of a 
certain emotional tone in the Thematic 
Apperception Test upon the tone attrib- 
uted to succeeding cards, as suggested 
by Helson’s stimulus-background formu- 
lations. 

Since order of stimuli appears to affect 
ALs within tests, the order in which 
tests are arranged in a battery may 
affect one another too; that is, each 
test interacts with the others as a series 
stimulus at the same time serving as a 
background experience for the next test 
confronted by the subject. Given this 
view, the clinician should be alerted 
to the necessity of charting the vicis- 
situdes of the subject’s shifts from one 


task to another as revealing overall 
adaptation, 


IMPLICATIONS or AL THEORY FOR 
THE CLINICAL TEST SITUATION 


There are three implications of the 
theoretical model proposed herein for 
the clinical test situation: 

Theoretical. The theory of AL offers 
a scientific explanation in place of a 
quasi-mystical description of the clinical 
test situation. It substitutes an opera- 
tional, quantifiable framework for make- 
shift explanations. Chiefly, it provides 
the necessary relativity to account for 

the adaptive behavior of the subject 
rather than an absolutist view of fixed 
trait behavior. The theory of AL opens 
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the way to a further understanding of 
the process of clinical judgment hereto- 
fore largely refractory to scientific study. 

It also offers the possibility of new 
insights into personality in adaptation 
terms, such as the meaning of integra- 
tion and the psychological dynamisms 
impeding or facilitating new adaptations. 
From the standpoint of general psy- 
chology, AL theory provides rapproche- 
ment between psychology of the labora- 
tory and psychology of the clinic. If 
it is a truism that increased mathemati- 
zation is essential to the development 
of a science, then the way may be paved 
by AL theory for clinical psychology to 
take its place within this historical 
process. 

Experimental. The theory of AL by 
virtue of its operational nature may be 
expected to stimulate and facilitate 
further experimentation on variables 
attending the clinical test situation. 
Thus the way is opened towards assess- 
ing the relative contributions of prior 
experience, background, and focal 
stimuli vis-à-vis subjects, 

The terra incognita of interactions 
among tests in a battery is now open to 
exploration. Factors influencing clinical 
judgments can be investigated system- 
atically with a view towards improved 
training. The problems of mensuration 
involved in clinical testing may be 
Subject to scrutiny with a degree of 
precision approaching psychophysical 
experiments. 

Practical. Advantages that may accrue 
to the clinician through the adaptation 
model are: (a) greater understanding of 
the adaptive capacities of the individual 
vis-à-vis different tasks and stresses; 
(b) ascertaining the degree of consist- 
ency of a subject—that is, what is 
generally typical of a subject as against 
what may be typical of a subject m0- 
mentarily; (c) an objective index C 
adjustment or adaptation and a dynamic 
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formulation of the adaptive process; 
(d) a common theoretical framework 
within which to view the patient’s per- 
formance on a variety of different tasks; 
(e) a further understanding of the proc- 
ess of clinical judgment towards im- 
proved training in use of instruments 
and in clinical role-taking behavior. 
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Adaptation-level (AL) theory represents a quantitative approach to the 


concept of adjustment and offers an 


experimental psychological point 


of departure for a science of personal and social behavior. It is suggested 
that a personality theory develop from classification of response types 
specifying relevant aspects of the focal, background, and residual factors. 


The various psychopathologies may 


be similarly viewed in terms of 


specific behavioral excesses or impairments within the framework of AL 
theory. A proposal for the use of AL theory in regard to personality 
and psychopathology is outlined and representative studies are cited. 


Somewhat more than a decade and a 
half has elapsed since Helson (1947) 
presented his adaptation-level (AL) 
formulation as a general approach to 
psychophysical data. The theory of AL 
might not have become more than an 
intramural psychophysical issue, if it 
did not expand into a model for frames 
of reference, a schema for a psychology 
of adjustment, and the embryo for a 
general behavior theory.? Exponents of 
this point of view studied the applica- 
bility of its generalizations about adjust- 
ment and pooling within Psychophysical 
judgment areas (Michels & Helson, 
1954), and then ventured, with success, 
into the fields of motivation and learn- 
ing (Bevan & Adamson, 1960), personal 
and social variables (Blake, Mouton, & 
Hain, 1956; Blake, Rosenbaum, & Dur- 


1 Paper presented in “Symposium: Contin- 
porary Problems in Adaptation-Level Theory,” 
American Psychological Association, St, Louis, 
1962. Work supported by United States Public 
Health Service Grant M-1121. 

2 The interchangeable use of the words 
schema and theory reflects the authors’ con- 
flict concerning purity regarding these concepts, 
Without presenting any tedious discourse Ta- 
tionalizing either word within a philosophy of 

science, schema and theory will be used to 
satisfy all, while emphasizing the position that 
an adaptation-level formulation provides use- 
ful order and system at the same time that it 


measures. 
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yea, 1955; Rosenbaum & Blake, 1955), 
clinical judgment (Campbell, Hunt, & 
Lewis, 1957), and intelligence (Heim, 
1955). 

This paper is specifically concerned 
with a second look at AL schema that 
emerged from a psychophysical tradition 
which is offering promise as a general 
approach to a science of the person and 
his psychopathologies at a time when 


personality theories are seyere disap- 
pointments. 


PERSONALITY THEORY 


Our psychophysical ancestors and 
their psychometric cousins did not fore- 
see modest excursions into sensory and 
motor functions leading to a testing 
movement that would produce an arro- 
gant step-grandson called clinical psy- 
chology demanding a recognized place 
in a respectable science while violating 
all of the ground rules. This step-grand- 
son flourished and is presumably doing 
mankind a service but is only loosely 
tied to general psychology by a psycho- 
metric tradition, an increasing accumula- 
tion of unorganized and contradictory 
research results, fragile bridges betwee” 
examining room observations and 
psychologies of learning, sensation 2” 
perception, and fiat. : 

The clinical psychologist with = 


¢ 
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functional concern with disorder rapidly 
outgrew the psychophysical tradition 
when the returns from an era of individ- 
ual differences and mental testing dimin- 
ished. Even the more sophisticated taxo- 
nomic factor analysts who attempted to 
provide clinical psychology with a more 
up-to-date psychometry failed to pro- 
vide a shoe that fit the burgeoning clini- 
cal foot. Psychodiagnosis and psycho- 
therapy became more complex than a 
reaction time or keenness of discrimina- 
tion study, and even the higher mental 
functions of Binet were not high enough 
to deal with the problems of mankind 
that fell into the receptive laps of psy- 
chologists, 

Clinical psychology with its mental 
tests emerged from a psychophysical 
heritage and then renounced its ancestral 
ties. With ineffective conceptual and 
methodologic models available in tra- 
ditional laboratory psychology, the clini- 
cian was forced to take matters into his 
own hands, and out of his rank a new 
breed was born—the personality theo- 
rist. This specialist was created to de- 
velop a science of the person that would 
receive acceptance as theory and act as 
a clinically useful point of departure. 

However, a science of the person 
should not reflect the desperation and 
impatience of the clinician, and the arts 
and sciences of behavior must meet 
within common frames of reference 1n 
accordance with the developmental level 
of each. ; 

No aspect of a scientific psychology 
can defend itself as some off-brand 
humanism, Or on-brand philosophy of 
existence. 

Psychologists can learn much from 
humanists concerning missions and tech- 
nologic goals. The humanist can remind 
the psychologist that one of his responsi- 
bilities is to regard human subjects as 
people. 


The philosopher of existence and his 
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band of psychoanalytic followers can 
provide an occasional astute observation 
that might be incorporated into a scien- 
tific philosophy. In the maze of fanciful 
and literary excursions into the realm of 
mythology, or pronouncements about the 
destiny of man, whether optimistic or 
pessimistic, whether toward self-destruc- 
tion or self-realization, one might find 
profound and testable observations, or 
more likely get lost and ensnared. 

When one accepts the concept per- 
sonality theory, one accepts a frontal 
assault upon the person that measures 
directly the integration of all of the attri- 
butes of cognition, conation, sensation 
and perception, motivation and learning, 
traits and dispositions that offer at once 
the common and unique qualities of a 
given intensity regarding an organism 
within a species. This would require a 
separate science of the person with its 
own methods and language. 

Psychological attributes combine as 
the dimensions of behavior that interact 
or pool into the entity we casually call 
personality. No theory can ignore the 
obligation to identify and label these di- 
mensions, measure their attributes, and 
explicate the process of pooling that 
leaves us with the person. The refusal 
to highlight a morphology of the person 
within a scientific behavior theory is in- 
excusable. The clinician’s abhorrence of 
dissection and the sight of the psycho- 
logical blood reflects his commendable 
humanism, but the systematic labeling 
of the gross and microscopic anatomy of 
the psychological cadaver may lead to 
an ordered science of the person and his 
behavior pathology. 

What are now called theories of per- 
sonality are not theories at all. They 
either represent philosophies of being, 
methods designed to study unsystema- 
tized behavioral attributes (Cattell, 
1946; Eysenck, 1947; Sheldon, 1940, 
1942, 1954), or some translation of a 
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philosophy of existence into a behavior 
theory with much lost in the translation 
(Dollard & Miller, 1950; Mowrer, 
1950). 

Psychology should acknowledge the 
utility of these discourses on the possi- 
ble nature of people in the clinical arts. 
Orderly philosophies of man by various 
psychoanalysts or other-named psycho- 
therapists are useful for the clinician who 
must describe observations in a com- 
municable, literary way. These tempt- 
ing philosophies are functional for the 
clinical psychologist as a language for 
descriptive psychodiagnostics and verbal 
psychotherapies. These fluent discourses 
are not theories and a science of man 


will not emerge from clinically useful 
literary expositions, 

A revisit to the psychophysical neigh- 
borhood of his parents might be of more 
than sentimental value to the clinical 
psychologist. 


Psycuopuysics, PERSONALITY THEORY, 
AND PSYCHOPATHOLOGY 


The psychophysical methods which 
may be identified both with the emer- 
gence of scientific Psychology and clini- 
cal psychology have at times been treated 
as ends in themselves, and the term psy- 
chophysics has been synonymous with 
classical procedures used to settle thresh- 
old and difference limen problems of 
interest to very few people. 

A “simple” absolute threshold deter- 
mination is variable, relative to a variety 
of contextual and organismic factors, 

Judgment is a consequence of a pooling 
process which integrates the patterns of 
focal and background inputs, as well as 
concepts or past experience Tesiduals 
based upon meetings with similar stim- 
uli. Present psychophysical laws and 
their mathematical accompaniments pro- 
vide rough approximation predictions 
within a restricted domain of laboratory 
circumstances. The tradition that 


started with Weber and Fechner had a 
more noble goal that sought to relate the 
responses of organisms to the energetic 
configurations imposed upon their re- 
ceptors. Any general approach to psy- 
chophysics represents a study of the 
judgment process under the plethora of 
available focal and background input 
conditions, and the endless varieties of 
organismic states, 

This less than modest goal of creating 
a general theory, or a scientifically use- 
ful conceptual schema for the clinician 
and psychopathologist out of a psycho- 
physics that appeared a century ago to 
study the sensitivity of the organism, 
may appear to represent a regressive re- 
version to the unsuccessful attempts by 
Kraepelin (1896) to use the psycho- 
physical laboratory as the prototype for 
a scientific psychiatry. The difference 
rests with the functional distinctions be- 
tween classical and frame of reference 
psychophysics. The latter permits a be- 
havior taxonomy based upon responses 
to focal and background inputs, as well 


as classification of the attributes of 
concepts. 


AL as A GENERAL CONCEPTUAL SCHEMA 


The clinical psychologist created the 
notions of personality and personality 
theory in the quest of a functional psy- 
chology of the integrated dimensions of 
behavior that produced an ordered or 
disordered person. Although their con- 
cern was with adjustment and adapta- 
tion, their holistic, humanistic philoso- 
phies distracted them with an obsessive 
concern with motivation and purposes of 
mankind. 

Every psychophysical experiment is a 
study of adjustment. Any response to 
a configuration of focal and background 
inputs with a given organismic state rep- 
resents an adaptation. Although obvi- 
ous, devotion to method blinded the 
psychophysicist to his calling as a gen- 
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eral behavior theorist of adaptation. It 
remained for Helson (1947, 1948, 1955, 
1959) to recognize the relationship be- 
tween specific and general instances of 
adaptation; a conceptual schema as a 
point of departure for a quantitative 
theory of behavior was established. 
Others recognized the importance of 
focal and background conditions in scale 
formation and the designation of indif- 
ference points. The homeostatic em- 
phasis in physiologic (Cannon, 1939) 
and behavioral contexts is not new. 
However, with the rallying concept ad- 
justment Helson brought these consid- 
erations together into a single systematic 
schema that permits manipulation of 
contextual and residual variables, meas- 
urement, and first approximation mathe- 
matical formulations. 

Although AL theory began as a series 
of generalizations regarding spectral 
energy input and background reflect- 
ances on the one hand, and perceived 
color on the other (Helson, 1938; Hel- 
son & Jeffers, 1940), its author (Helson, 
1947, 1948) took the risk of criticism for 
extending a laboratory and physiologi- 
cally based concept adaptation through 
analogy. Is the sensory psychological 
and sensory physiological time-honored 
identification of adaptation related to 
adaptation within a series of lifted 
weights, or are we dealing with a seman- 
tic artifact which is a function of the 
labels within a category scaling proce- 
dure (Stevens, 1958)? Does personal 
adaptation so basic for the clinician bear 
any resemblance to the spirit and letter 
of these terms as understood in the labo- 
ratory? Is this extension of adaptation 
from its physiological to absolute judg- 
ment to personal and social contexts 
little more than a word game? 

It is well known that AL schema 
(Helson, 1947, 1948) defines adjustment 
in terms of responses to situations scaled 
along bipolar continua indicating neu- 


tral, or balanced states. This formula- 
tion viewed the neutral AL as a 
weighted geometric mean of three classes 
of stimuli affecting an organism within 
a temporal context: the inputs pri- 
marily within the contemporary, focal 
scene; all other inputs imposed upon 
available receptors forming the back- 
ground for the focal circumstance; all 
organismic factors including residuals 
or concepts based upon historic en- 
counters with relevant inputs, as well 
as constitutional and dispositional fac- 
tors. All three are hopefully definable 
in stimulus terms and pool through an 
integration process to produce AL. 

The specific details and assumptions 
of AL theory regarding the judgment 
process as well as its dangerous but suc- 
cessful missions beyond the psychophysi- 
cal bench are reviewed elsewhere (All- 
port, 1955; Guilford, 1954; Helson, 
1947, 1948, 1955, 1959). While Hel- 
son’s psychophysical upbringing leaves 
him with less interest in the third cate- 
gory of pooled stimulation, residuals or 
concepts, he acknowledges (Helson, 
1959) that herein rest the critical areas 
for personality theory. The student of 
healthy and pathologic personal be- 
havior is concerned with the develop- 
ment of residuals, the mechanics of their 
pooling with contemporary stimulation, 
and the process of conceptual modifica- 
tion by the manipulation of focal and 
background conditions to facilitate effec- 
tive adjustment to changing internal and 
external contexts (Helson, 1955, 1959). 


AL AND PERSONALITY 


Sympathy with global theories of per- 
sonality allows the bypassing of funda- 
mental psychological attributes and di- 
mensions that integrate or pool leaving 
a “whole, undissectable person.” This 
may appear as a short cut but it is un- 
likely, after a half-century of flounder- 
ing, that a general behavior theory of the 
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person will emerge from the examining 
room or mental health agencies. It is 
probable that the lessons of history will 
point to the laboratory for a science of 
people. 

If this is the case how can the clini- 
cian avoid losing the person? How can 
AL schema provide methods and sys- 

tems that will not violate the person, and 
his healthy or psychopathologic adapta- 
tion? 

There are two global methods to di- 
rectly approach personality and per- 
sonality theory: those involving the 
mechanics of proof emphasizing statisti- 
cal design and rigor, or others operating 
through clinically useful but scientifi- 
caily deplorable literary expositions (Ad- 
ler, 1954). Neither approach provides 
scientifically useful theories, models, or 
schemata. 

There are two approaches to the appli- 
cation of AL theory to the science of the 
person. One involves a direct attack 
upon attitudes and everyday behavior 
dynamics using the AL model in a 
study of what is generally called per- 
sonality and social variables. Investiga- 
tions of volunteering and generosity 
(Blake, Rosenbaum, & Duryea, 1955; 
Rosenbaum & Blake, 1955) using labo- 
ratory derived principles of judgment 
represent a direct application of AL 
schema to personal and social situations. 

A second keeps the investigator and 
theorist in the laboratory and although 
he, too, manipulates focal and back- 
ground conditions as well as the more 
elusive residuals, he stays with well- 

graded input continua emphasizing tra- 
ditional magnitude and sensory dimen- 
sions. , 

This is simply a difference in research 
strategy. The direct application of AL 
principles to personal and social 7 
bles has been the more usual approac 
when this schema has been extended to 


- -tade the behavior of people in every- 
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day life situations. However, one should 
not overlook the possibility that a theory 
of personality and psychopathology 
might just as well emerge from a classi- 
fication of response types based upon 
traditional laboratory procedures. 


BIoMETRICS AND AL 


Adjustment within a given series of 
weights associated with a controlled 
background context along with known 
organismic characteristics is as much an 
experiment in, and measurement of per- 
sonality, as some crystal ball that re- 
veals hidden hostilities to poor unsus- 
pecting siblings, or ghosts from the past 
awaiting exorcism upon the couch. Cate- 
gory judgments of duration within a 
well-graded temporal series under spe- 
cific sensory and anchoring conditions 
can be considered as much a measure of 
personality as the judgment of the sub- 
ject who may or may not volunteer in 
the face of varied social pressures. All 
of these are measures of personal be- 
havior, but a taxonomy that may permit 
a personality theory is more likely to 
emerge from the laboratory, 

Surely all psychophysical responses 
are partly if not primarily cognitive. In- 
vestigators need only seek modal re- 
Sponses to specific focal and background 
conditions, within the framework of spe- 
cific organismic or residual states, and 
define these modal responses as the di- 
mensions of behavior to be systematized 
into a science of the person. With our 
present vague definitions of personality, 
no one will deny that a laboratory psy- 
chophysical adjustment is a measure of 
personality, but the clinician will ask 
about the relationship between adapta- 
tion to a series of sounds of varied in- 
tensity and adjustment to everyday life 
experiences, 

Simple laboratory adjustments tO 
series of magnitudes are indeed adjust- 
ments to everyday life situations. The 
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question is not one of general relevance, 
but of specific relevance. The scaling of 
intensity, extensity, or protensity is part 
of general adjustment. We are all ex- 
posed to everyday life analogues of the 
various psychophysical methods and 
scaling procedures within the f ramework 
of a multitude of dimensions. These 
judgmental circumstances may, with 
their contextual variations, encompass 
the gamut of problems faced by clini- 
cians in the examining room. 

Zubin’s (Burdock, Sutton, & Zubin, 
1958; Zubin, 1958) close association 
with problems of personality and psy- 
chopathology has increased the firmness 
of his position that regards the psychol- 
ogy of ordered and disordered people as 
a laboratory science. Proposing a bio- 
metric approach to the study of person- 
ality and psychopathology, he offers five 
relevant, measurable activity levels that 
may be investigated quantitatively dur- 
ing a base line idling state and under 
stress-loading conditions. Zubin (1958) 
courageously turned back the clock and 
re-emphasized the objective techniques 
offered through psychophysics and psy- 
chometrics. His biometric approach 
hopes to integrate the data from the 
hierarchy of the five basic functioning 
levels: physiologic, sensory, perceptual, 
psychomotor, and conceptual under base 
line and stressful conditions. This view 
requires a laboratory approach to per- 
sonality and psychopathology focusing 
upon the integration of conditions of 
stimulation and levels of organismic 
functioning, pleading for rallying con- 
cepts and models that exploit consisten- 
cies and discover major sources of 
variance. 

A biometric model requires a tax- 
onomy of behavior based upon the psy- 
chophysical and psychophysiological 
sciences and their relationships. Zubin’s 
search for schematization reflects a con- 
cern about the undirected multiplication 


and accumulation of tarw -~ 

pathology; his (Burdock, Sutton, « — 
bin, 1958) dismay with current concep- 
tions of personality and personality 
theory Jeads to the conclusion that one 
cannot take for granted any functional 
interdependence regarding these theories 
and mental disease. This is not the case 
if one begins with a biometric AL ap- 
proach to personality and psychopa- 
thology. 

Biometrics provides a useful empirical 
track and offers wise suggestions regard- 
ing the need for schematization. How- 
ever, the approach of biometricians is 
first and foremost a devotion to meas- 
urement and the classification of broad 
areas to be measured. The homeostatic 
issue of adjustment within AL theory 
may provide the schema for ordering the 
multitude of biometric facts that are 
accumulating. 


An AL TAXONOMY OF PERSONALITY 
AND PSYCHOPATHOLOGY 


A psychological classification system 
based upon a quantifiable concept of ad- 
justment derived from AL theory may 
permit a scientific psychology of the 
person and his psychopathology. The 
characteristics of input signals that con- 
vey focal and background information 
determining adjustment levels for people 
or groups can be specified with precision 
and accuracy. It is feasible to scale 
individuals according to response types 
based upon characteristic modes of be- 
having in regard to controlled foreground 
and background conditions. Individuals 
differ with reference to the extent to 
which they weight or pool various as: 
pects of an external context; subject 
share certain common response patter’ 
within a specific external context; ` 
dividual differences exist regarding 
relevance of details within the foce 
background contexts; people do 
carry equivalent residuals from 
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experience into experimental and clini- 
cal contexts; and, just as personality 
may be defined in terms of AL re- 
sponse types, so can specific excesses 
be classified as psychopathology. 

Personality and psychopathology can 
be defined in terms of ALs in relation to 
specific focal, and background contexts 
on the one hand, and organismic resid- 
uals on the other. 

It will be necessary to expose the 
seemingly endless relevant contextual 
factors in a psychophysical judgment, 
quantitatively assess their contribution 
to the AL, and apply derived princi- 
ples across dimensions to evaluate their 
generality. We can point to central 
tendency effects, time-order-errors, se- 
quential effects, end effects, contrast 
and assimilation effects, and ALs lo- 
cated in the neighborhood of the 
weighted geometric mean of a series. 
These offer first approximation generali- 
zations that are dependent upon an 
abundance of known and unknown con- 
textual and residual factors, With AL 
so dependent upon the capricious and 
very complex context, is it any wonder 
that contradictory findings appear re- 
garding specific aspects of judgment? 

The broad schematization has proven 
useful; a pooling theory of adjustment 
should be fruitful for a general psy- 
chology of the person. Now it is neces- 
sary to get down to the business of the 
exploration of details. The microscopic 
components of a broad schema represent 
its experimental essence. A broad 
schema that leads to a point where one 
must abandon hope for quick theory, 
but does lead to a point of increasing 
experimental returns requiring effortful 
exploration of detail is likely to leave one 

on a productive road. The AL is ready 

for the step beyond broad, schematic 
eralizations. 

os There is the tedious task of identi- 

fying the many relevant attributes 
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within a given judgmental dimension 
that contribute to an adaptation level. 
This will be a long-suffering, unglamor- 
ous enterprise. It is known that se- 
quential effects (Parducci, 1962) and 
channel of input (Behar & Bevan, 1960, 
1961; Goldstone, Boardman, & Lhamon, 
1959; Goldstone & Goldfarb, in press; 
Lhamon, Edelberg, & Goldstone, 1962) 
are relevant in specific judgmental cir- 
cumstances. Absolute threshold is not 
a limiting factor in scale shifting regard- 
ing intensity and size judgments (Black 
& Bevan, 1960; Boardman & Goldstone, 
1962; Goldstone, Goldfarb, Strong, & 
Russell, 1962), stimulus range, and pres- 
entation interval influence judged loud- 
ness (Bevan, Reed, & Pritchard, 1961). 
and information regarding relevant as- 
pects of effective anchors in form judg- 
ment is now available (Bevan & Pritch- 
ard, 1963). A long list of stimulus 
and presentation parameters is waiting 
to be investigated in seemingly endless 
combinations to determine what in- 
fluences judgment. 

2. After relevance within a judgment 
dimension is established and classified, 
the measurement of the extent of in- 
fluence is a next step. Concern with the 
amount of impact of a stimulus pa- 
rameter under a controlled presentation 
circumstance, the effect of a presentation 
circumstance under controlled condi- 
tions of stimulation and the interaction 
of stimulus and presentation variables 
are of basic interest. 

3. It is important to know whether 4 
relevance which produces predictable 
alterations in the AL is specific to a 
given judgmental dimension, or if one 
possesses a more general principle of 
behavior. It is necessary to derive meas- 
urable similarities and differences in the 
effect of a contextual factor across judg- 
ment dimensions. 

4. The elusive residual may be viewed 
as the wastebasket for AL theory °" 
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it may be an inescapable and even de- 
sirable region for study. Traditionally 
the psychophysicist focuses upon the 
contemporary scene and the residuals 
or concepts learned from previous en- 
counters with relevant stimuli are ig- 
nored, trained-out, or averaged into ob- 
livion through the use of groups. Insofar 
as these concepts reflect response dis- 
positions they are of essential impor- 
tance to the student of the person. 
Insofar as residuals represent the im- 
mediate displays of disorder they are the 
primary concern of the psychopatholo- 
gist. A science of the person and his 
psychopathology is concerned less with 
the organism as a “null instrument” and 
more with perceptual-conceptual (Bru- 
ner, 1957; Goldstone, 1962) behavior 
grossly classified in terms of attitudes, 
thinking, and experience. The absolute 
method emphasized by AL theory rep- 
resents a more relevant laboratory ana- 
logue of the personal encounters involved 
in everyday life adjustments. Whereas 
a comparison method provides an ex- 
ternal standard to an organic meter, 
the absolute method requires the person 
to pool input configurations with internal 
norms or concepts. Controlled focal and 
background configurations reveal data 
concerning the status of these internal 
standards. The behavioral dysfunctions 
of the various psychopathologies (Gold- 
stone, 1962; Zubin, 1958) are rarely 
of a metering or sensitivity nature. Dis- 
ordered absolute judgments indicate al- 
terations, impairments, OF deficiencies 
in the scaling process OF the residual 
concept. The student of the person and 
his mental disorders is particularly in- 
terested in the use of past experience 
and the ability of concepts to pool with 
relevant contemporary inputs and ward 
off inappropriate background noise. 
Residuals are learned and are avail- 
able within certain acceptance-rejection 
limits. Inputs are compared with these 
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concepts and a neutral point is estab- 
lished. 

The delusion and hallucination of the psy- 
chotic, the confusion of the toxic state, the 
exaggerations or denials of the neurotic, and 
the ineffectiveness accompanying personality 
disorder and mental defect all reflect absolute 
judgments that do violence to either the nature 
of an accepted norm or the accepted range of 
that norm . . . [Goldstone, 1962, p- 264]. 


The theory of AL permits the defini- 
tion of a person’s disordered judgment 
in terms of his specific response to the 
detailed focal or background stimulus 
configurations. Conceptual impairment 
may be a function of insufficient learn- 
ing, incorrect learning, conflictual learn- 
ing, or, a contemporary disruption in 
conceptualization and scaling due to such 
other organismic factors as fatigue, in- 
toxication, or mental illness. 

These residuals need not be considered 
mysterious aspects of the conscious 
or unconscious mind unapproachable 
through laboratory techniques. Concepts 
may be assigned attributes much like 
the sensations. These attributes such as 
vividness or intensity, availability, range 
of identity and range of response cate- 
gorization (Goldstone, 1962) may be in- 
vestigated by developmental studies of 
learned social norms, as well as by 
studies of immediate and successive 
pooling in order to explore the persist- 
ence and strength of these residuals in 
the face of relevant and irrelevant focal 
and background stimulation. 

The taxonomy of these residuals and 
their pathology remain as a challenge to 
the AL theorist. 

5. As each relevant internal and ex- 
ternal influence is discovered, labeled, 
and extensively explored within and be- 
yond a specific judgment dimension, the 
theorists may begin moving from meth- 
odologic successes tO theory. Increas- 
ingly sophisticated measurement and the 
discovery of additional relevant factors 
and their mode of operation may direct 
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the pooling schema of AL to a general 
behavior theory of adjustment. The 
definition and measurement of more at- 
tributes of focal, background, and re- 
sidual conditions will complicate the 
mathematics of a psychophysical law but 
the principle of adjustment will remain 
unchanged. The process of pooling, pat- 
terning, or organization will become the 
scientific focus of attention. It is now 
essential to determine what is being 
pooled and the conditions under which a 
given quality and quantity of pooling 
transpires. 

The need for a functional biometric, 
AL model for personality and psycho- 
pathology cannot be overemphasized. 
The clinical-experimental psychologist 
is faced with the problem of ordering 
the abundance of research reports that 
accumulate daily offering generalizations 
about people and mental disease. It be- 
comes more perplexing when people 
with schizophrenia are compared with 
healthy controls and found to have re- 
duced size constancy, overconstancy, or 
equivalent constancy (Rausch, 1952; 
Reynolds, 1953; Sanders & Pacht, 1952; 
Weckowicz, 1957). The reduced con- 
stancy accompanying schizophrenia ob- 
tained by Boardman, Goldstone, Reiner, 
and Himmel (1964) was interpreted in 
terms of specific conceptual alterations 
accompanying the illness, in relation to 
specific contextual factors, 

More confusion is available when one 
notes that with identical psychophysical 
methods, schizophrenic patients show in- 
creased overestimation of 1-second dura- 
tions (Lhamon & Goldstone, 1956; 

Weinstein, Goldstone, & Boardman, 
1958; Wright, Goldstone, & Boardman, 
1962) while underestimating the length 
of 1 inch (Boardman, Goldstone, Reiner, 
& Fathauer, 1962). Healthy subjects 
were more accurate and precise, and, less 
influenced by anchors, when making 
absolute judgments of an inch than of 
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a second (Boardman, Aldrich, Reiner, 
& Goldstone, 1959; Goldstone, Lhamon, 
& Boardman, 1957). Salzinger (1957) 
found increased anchor effects by schizo- 
phrenic subjects in a weight-lifting ex- 
periment which was confirmed in regard 
to spatial judgments (Boardman, Gold- 
stone, Reiner, & Fathauer, 1962). This 
was not found with similarly obtained 
temporal judgments (Webster, 1962). 
General statements about magnitude 
judgment in general and relations with 
personal or psychopathology variables 
must consider the specific dimension or 
sensory attribute and likely relevant 
contextual influences, 

Even general statements about magni- 
tude judgments by healthy and psycho- 
pathologic subjects are not safe within 
a given dimension. It was discovered 
that the generalization that schizo- 
phrenic patients Overestimated auditory 
durations obtained from three replica- 
tions (Lhamon & Goldstone, 1956; 
Weinstein, Goldstone, & Boardman, 
1958; Wright, Goldstone, & Boardman, 
1962) was an oversimplification. The 
alteration was not a general property of 
time judgment at all, but a function of 
Specific aspects of the temporal cir- 
cumstance. The original studies used 2 
modified method of limits procedure 
with an arithmetic series. When the 
stimulus arrangements were altered to 
either a geometric step interval (Wright, 
Goldstone, & Boardman, 1962) or the 
constant method (Webster, Goldstone, 
& Webb, 1962) the schizophrenia- 
healthy differences disappeared, Close 
attention to these important contextual 
details and their differential effect upor 
various populations will provide the 
basic information about healthy a” 
psychopathologie response types. Cori 
textual influences upon patient-contre 
differences are not sources of expet!” 
mental error, but represent the essence 
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of these differences and provide a basis 
for a personal taxonomy. 

Other studies have shown reduced suc- 
cessive pooling by schizophrenic patients 
in time-judgment experiments using al- 
ternate anchors (Webster, 1962) ; Wein- 
stein, Goldstone, & Boardman, 1958) 
and channel of input (J. L. Goldfarb, 
unpublished), and, sense mode has also 
been suggested as a relevant dimension 
in predicting patient-control differences 
(Wright, Goldstone, & Boardman, 
1959). Impaired successive pooling and 
reduced intersensory discrimination ac- 
companying schizophrenia permitted a 
psychological explanation of hallucina- 
tions based upon the AL schema (Gold- 
stone, 1962). 

Pointless arguments about contradic- 
tory results (Aronson, Silverstein, & 
Klee, 1959; Boardman, Goldstone, & 
Lhamon, 1957) do little more than em- 
phasize the oversimplification involved 
in general statements about personal or 
group characteristics when the relevant 
momentary input and internal details 
are not considered. There are no con- 
tradictory or conflicting results, only in- 
sufficient attention to, and specification 
of, the relevant focal, background, and 
residual factors. 

Regarding the context alone, AL fol- 
lowers have a long, hard road cut out 
for them. Adding a scientific concern for 
the less manageable residuals compli- 
cates but completes a far-reaching goal 
of a science of the person and his psy- 


chopathology. 


SUMMARY 


The direct, global assaults upon a pos- 
sible science of the person have provided 
us with useful clinical philosophies but 
have not left us with conceptual ap- 
proaches that might resemble embryonic 
theory. 

As a quantitative approach to adjust- 
ment, the broad AL schema offers a pos- 


sible general psychological point of de- 
parture that might lead to a theory of 
personality. 

It is proposed that a laboratory based 
AL theory of personality develop from 
a classification of response types speci- 
fying the relevant aspects of the focal, 
background, and residual factors. i 

The various psychopathologies may be 
similarly viewed in terms of the specific 
excesses or impairments within the 
framework of Helson’s AL and Zubin’s 
biometric models. 

Every factor relevant to the location 
of a neutral point and every influence 
that shifts a reference scale is an at- 
tribute of personality. It is part of the 
person and he may be compared with 
others, or at times, with himself, with 
regard to these factors and influences. If 
there is to be a science of the person or 
a personality theory, every adjustment 
determined by the abundance of mo- 
mentary properties of focal and back- 
ground stimulation, as well as the 
momentary residuals or state of the or- 
ganism must be considered a personality 
measure. Whether or not a specific lab- 
oratory measure of adjustment is rele- 
vant to the personal adjustments of 
interest to the clinical psychologist is 
for the scientific psychopathologist to 
decide. This newer breed of personality 
theorists, emerging from both the labora- 
tory and the examining room can desig- 
nate the dimensions of adjustment that 
are of interest to the clinician. These 
can be specified in the language of AL 
and the focal and background inputs can 
be manipulated in the laboratory under 
idling and stressful conditions developing 
a taxonomy of personality which will 
permit comparison of psychopathology 
with health. 
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Retention curves based upon recognition scores may be comparable in 
slope and amount of retention to curves based upon recall and anticipa- 


tion performance. 


Previous contrary conclusions are overgeneralized. 


They are the result of the selection of easy recognition tests and failure 
to control the variable of overlearning. Measures of recognition, free 
recall and anticipation are dichotomous, and the slopes of curves based 
upon such measures are artifacts of the changing sensitivity of the meas- 
ure. The curves can therefore not provide the basis for general conclusions 
regarding relative amounts of forgetting over different time periods, 


Ambiguity regarding the interrelations 
among commonly used response indi- 
cants creates serious methodological as 
well as theoretical problems. If these 
interrelations are not well understood, 
overgeneralizations and misinterpreta- 
tions of experimental findings are un- 
avoidable (Bahrick, Fitts, & Briggs, 
1957). The classic study establishing 
the interrelations among measures of 
retention was performed by Luh (1922), 
Postman and Rau (1957) recognized 
certain inadequacies of Luh’s method. 
They collected extensive new data and 
thoroughly reviewed the literature on 
this topic. One of the principal differ- 
ences between the findings of Luh and of 
Postman and Rau pertains to the curves 
based upon recognition measures. Post- 
man and Rau found practically no de- 
cline of recognition scores over a 2-day 
period when subjects originally learned 
the material to a criterion of one perfect 
anticipation trial. They report: “For all 
practical purposes the method of recogni- 
tion shows no forgetting at all during the 
time interval used in the present study 
[Postman & Rau, 1957, p. 291].” They 
suggest that the decline of recognition 
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performance over the same period re- 
ported by Luh was the result of method- 
ological errors of interpolating tests of 
reproduction, as well as other uncon- 
trolled interference effects which tended 
to lower subsequent performance on 
recognition tests. In regard to the inter- 
relation between indicants of recall and 
recognition Postman and Rau (1957) 
concluded: “The one fact for which there 
is substantial experimental evidence is 
that tests of recognition yield higher 
scores than do tests of recall [p. 218].” 
This conclusion has extensive documen- 
tation (e.g., Achilles, 1920; Clarke, 
1934; Hollingworth, 1913; Miler, 1957; 
Myers, 1914; Stalnaker, 1935). No 
comparable support exists for Postman 
and Rau’s finding that recognition shows 
no retention loss at all over a 2-day 
period, but it is generally held that 
recognition curves show a more gentle 
slope and do not exhibit the negative 
acceleration observed in the classical re- 
call and anticipation retention curves 
(Luh, 1922; Miler, 1957). 

It is the principal purpose of this 
article to show that both of the above 
conclusions regarding the superiority of 
recognition to recall performance, ani 
regarding the slope of retention curves 
are overgeneralizations, and therefore 
misleading, because the findings 0” 
which they are based do not represent 
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intrinsic differences between indicants of 
recognition and recall. Rather, the find- 
ings reflect the exclusive use of experi- 
mental designs which fail to control the 
variable of overlearning, the exclusive 
use of “easy” recognition tests in which 
correct and incorrect alternatives are 
highly differentiated, and artifacts aris- 
ing from the use of dichotomous scoring 


procedures. 


EXPERIMENTAL DESIGNS 
AND OVERLEARNING 


In most studies of retention, stimulus 
material, rather than retention measures 
per se, have been the subject of investi- 
gation. Two general types of experi- 
mental design are conventionally used 
when retention of different stimulus ma- 
terial is investigated: (a) an equal 
amount of training is given to subjects 
in the various conditions to be compared 
and (b) subjects are trained to a com- 
parable criterion under the various con- 
ditions. To illustrate the advantage of 
using both types of design let us suppose 
it is desired to compare retention for 
nonsense syllables high and low in as- 
sociation value. Following Design @ 
an equal number of training trials is 
administered to the groups learning the 
two kinds of material, and retention is 
tested after suitable intervals. Fol- 
lowing Design b the two groups are 
trained until they reach a comparable 
response criterion, such as the first trial 


on which all responses are correct. If 
only Design 4 is used, and it is found 
that retention is better for the syllables 
high in association value, it is not pos- 
sible to determine if this is true just 
because the material is “easier,” and was 
therefore learned to a higher degree. 
By supplementing this comparison with 
data from Design b one can determine 
whether the high association value af- 
fected retention, independent of the de- 
gree of original learning. 
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Investigations in which comparisons 
of retention measures rather than of 
stimulus material are the subject of in- 
terest have made use of two variations 
of Design a, but have failed to sup- 
plement this with information from De- 
sign b. In the traditional version of 
Design a investigators have given all 
subjects a constant number of training 
trials and have later compared per- 
formance on recognition and recall tests. 
In the second version all subjects are 
trained to the same criterion, usually the 
first trial of correct anticipation, fol- 
lowed by the comparison between recall 
and recognition performance on later re- 
tention tests. At first blush it would 
appear that this second version corre- 
sponds exactly to the design of Type 
b, and should therefore yield the 
same kind of information, but this is 
not so. 

To compare retention measures by 
Design b, subjects of various groups 
must be trained until they have reached 
a comparable criterion on the same task 
on which retention is tested later. This, 
of course, entails administering different 
numbers of training trials for the vari- 
ous groups, similar to the case described 
earlier in which different numbers of 
training trials were administered to the 
groups learning nonsense syllables high 
or low in association value. In the second 
version of Design a described above, 
the subjects in the recall and recognition 
groups receive equal amounts of train- 
ing, and are not brought to a comparable 
criterion on the recognition and recall 
tasks on which retention is later meas- 
ured. Thus, the degree of original learn- 
ing with respect to the condition being 
tested during retention has not been 
controlled. As a consequence this type 
of study does not yield the information 
usually available from designs of Type 
b, and as a matter of fact, does not 
add much to the information already 


190 


available from the traditional version 
of Design a. 
To compare retention for recall and 
recognition by Design b, it is neces- 
sary to train one group of subjects until 
all of their recall responses are correct, 
and another group of subjects until all of 
their recognition responses are correct. 
The general conclusion that tests of 
recognition yield higher scores than do 
tests of recall has not been based upon 
any such comparison and is, therefore, 
limited to situations in which the degree 
of original training is not comparable. 
Because investigators have used only 
those recognition tests in which correct 
and incorrect alternatives were highly 
differentiated, the degree of learning with 
respect to the recognition task was gen- 
erally much higher than the degree of 
learning with respect to the recall task 
when training was interrupted. The 
better performance on later retention 
tests, as well as the difference in slope 
of the retention curves based upon re- 
call and recognition measures, reflects 
this difference in the degree of original 
learning. Thus, Postman, Jenkins, and 
Postman (1948) reported that after six 
trials of exposure to 48 nonsense syl- 
lables subjects were immediately able 
to make an average of 24.33 correct 
recognition responses, but only 7.20 cor- 
rect recall responses if the recall test was 
administered first. They reported 27.66 
and 9.09 correct responses, respectively, 
if the recognition test was taken first. 
Similarly, Achilles (1920) reported that 
subjects who were given 50 seconds to 
study a list of 25 nonsense syllables were 
immediately able to recall an average of 
only 12% of the list, but correctly recog- 
nized 42%. Hollingworth (1913) found 
that the number of trials needed to reach 
the threshold of recall was larger than 
the number of trials needed to reach thie 
recognition threshold, and the ratio be- 
tween the two was larger for meaning- 
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ful than for meaningless material. Such 
data have led Andreas (1960, p. 447), 
Miler (1957), and others to conceptual- 
ize a hypothetical threshold of recog- 
nition which is lower than a hypothetical 
threshold of recall, so that the strength 
of a given memory trace may be below 
the recall threshold, but above the recog- 
nition threshold. 

It has long been known (Ebbinghaus, 
1913) that material which is overlearned 
will be retained better, or, more pre- 
cisely, that indicants of retention are 
not sensitive to early retention losses if 
the material has been overlearned with 
respect to the threshold of that indicant. 
Thus, when material has been sufficiently 
overlearned with respect to the recall 
criterion, recall scores may continue for 
long periods of time near the 100% 
correct level. We cannot demonstrate a 
classical retention curve by the recall 
measure for the words in the Lord’s 
Prayer or the National Anthem for most 
adult Americans. A somewhat analogous 
situation may occur when performance 
on easy recognition tests shows little or 
no decline for several days after associa- 
tions have been learned to the threshold 
of anticipation. This does not serve as 
a valid basis for contending, however, 
that recognition measures, per se, do not 
yield classical, that is, negatively ac- 
celerated, retention curves. The effect of 
the variable of overlearning upon the 
slope of recognition curves is apparent 
if we examine results from those in- 
vestigations in which the degree of over- 
learning was more limited. 

Luh also trained subjects to lesser 
criteria, such as 67% and 33% of the 
trials needed for the threshold of an- 
ticipation. He obtained recognition 
curves which showed significantly more 
loss during the early postlearning in- 
terval. 

Strong (1913) reported a negatively 
accelerated curve for recognition scores 
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in a situation in which training seems to 
have been interrupted near the threshold 
of recognition. His data also tend to 
support the present view that failure to 
obtain substantial decline of recognition 
scores in the immediate postlearning in- 
terval is a result of overlearning. A more 
precise interpretation of his curves is 
difficult, however, because training was 
not interrupted precisely at the threshold 
of recognition, and because additional 
variables were introduced by requiring 
his subjects to indicate various degrees 
of confidence concerning their recogni- 
tion responses. The negatively acceler- 
ated curve was obtained for the re- 
sponses made with the greatest degree of 
confidence. 

The generally accepted conclusion 
that performance on recognition tests 
is less affected by retroactive inhibition 
than performance on recall tests (Britt, 
1935; Hollingworth, 1913; Postman & 
Rau, 1957) is also limited by the above 
considerations. Overlearning makes ma- 
terial less vulnerable to interference 
effects, and therefore needs to be con- 
trolled for a valid comparison of such 
effects. The studies on which such com- 
parisons are based have generally em- 
ployed common rather than comparable 
criteria of original learning for the recall 
and recognition groups (Postman, 
1952). 


Easy or DIFFICULT RECOGNITION 
TESTS 


In discussing recognition measures 
Deese (1958) pointed out that the 
recognition scores depend upon various 
artifacts of testing: “The probability 
of a correct selection on the part of the S 
is at the mercy of the particular choices 
he has to make [p. 240].” Similarly, 
Underwood (1949) pointed out: 

If we required S to learn a list of adjectives 


and then place the adjectives among a group 
of nonsense syllables, S would probably show 
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very small loss in retention. Obviously the 
similarity of the test material to the other 
material is an important variable which deter- 
mines the recognition score [p. 512]. 


Everyone who has given multiple-choice 
tests of recognition in academic courses 
knows that one can manipulate the diffi- 
culty level of an item simply by varying 
the incorrect alternatives, The effect of 
degree of differentiation among correct 
and incorrect alternatives upon the 
threshold of recognition and upon recog- 
nition retention curves has not been in- 
vestigated systematically, but relevant 
data were reported by Lehmann (1888- 
89). He showed that net recognition 
scores for identification of gray colors 
declined from 87% correct to 17% cor- 
rect as the wrong stimuli presented to S 
became more similar to the correct ones. 
Similarly, Seward (1928) showed that 
frequency, speed, and confidence of cor- 
rect recognition responses depended 
upon the similarity of incorrect to cor- 
rect items. Postman, Jenkins, and Post- 
man (1948) showed that when one of 
the alternatives offered on a recognition 
test is more similar to the correct al- 
ternative, it receives a disproportionate 
share of the error choices. These data 
suggest that the number of trials needed 
to reach the threshold of a recognition 
test may be increased by increasing the 
similarity among the right and wrong 
alternatives of the test. Verbal material 
offers limited opportunities for manipu- 
lating the variable of similarity among 
responses along a continuum. Other 
symbolic material presents better oppor- 
tunities for this and is therefore more 
suitable for an investigation of this vari- 
able. In a study comparing retention 
curves for recall and recognition meas- 
ures (Bahrick & Bahrick, in press), dif- 
ferentiation among the correct an 

incorrect stimuli and responses In @ 
paired-associates learning task was var- 
ied. The paired-associate method of pres- 
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entation was used during training. Stimu- 
lus material consisted of 8 x 8 matrices in 
which 1 of the 64 cells was colored black, 
the remainder left blank, The responses 
to be matched with the stimuli were ran- 
domly chosen 3-digit numbers. One 
group of subjects was trained to the 
threshold of anticipation, that is, the 
first trial on which they correctly listed 
the 3-digit numbers paired with each of 
the matrices. The other two groups were 
trained, respectively, to an easy and to 
a difficult recognition threshold, as de- 
termined by two different recognition 
tests. In both the easy and the difficult 
recognition tests subjects were required 
to identify the correct matrix (stimulus) 
among five matrices presented in one 
row of the test blank, and to identify the 
associated number among five 3-digit 
numbers in the same row. In the easy 
recognition test the four wrong matrices 
differed from the correct one in that the 
stimulus position was displaced by a dis- 
tance of three cells; in the difficult recog- 
nition test the displacement was only one 
cell removed, Similarly, the easy recog- 
nition test provided incorrect response 
numbers which share none of the 3 
digits with the correct number, whereas 
the alternative response numbers for the 
difficult test shared 2 out of 3 digits with 
the correct number. The recognition 
tests were administered during each 
130-second intertrial interval. The same 
intertrial interval was used for the an- 
ticipation group, but this group took no 
recognition tests during the intervals, 
The mean number of trials required to 
reach the easy recognition threshold was 
8.0, the anticipation threshold 9.3, the 
difficult recognition threshold 11.5. The 
group differences were significant 
(p<.01). The data thus provided an 
instance in which a threshold of recogni- 
tion lies above the threshold of anticipa- 
tion for the same material. i 
In the Bahrick and Bahrick study 
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retention was measured after 2 hours, 
2 days, or 2 weeks by the same method 
used during the training of each group. 
All three groups showed negatively ac- 
celerated retention curves, and the per- 
centage correctly retained by the An- 
ticipation group was lower than for the 
Easy Recognition group, but higher than 
for the Difficult Recognition group. 
This was true for all three time intervals. 

This study clearly showed that thresh- 
olds for anticipation are not necessarily 
higher than for recognition, and the re- 
sults suggest that differences in the 
slopes of retention curves are determined 
by overlearning with respect to the 
threshold level, rather than by the re- 
tention measurement per se. The fol- 
lowing examination of artifacts produced 
by dichotomous scoring procedures 
makes this even more apparent. 


CHANGING SENSITIVITY OF 
DīcHoTomous Scorrs 


It has been demonstrated that the 
slope of learning curves based upon 
dichotomous scores is a predictable arti- 
fact of the changing sensitivity of the 
indicants at different stages of training 
(Bahrick et al., 1957). The slopes of 
such curves therefore can not provide a 
valid index of relative amounts of learn- 
ing at different stages of trajning. An 
analogous situation exists with respect 
to retention curves. Measures of an- 
ticipation, recall, and recognition are 
dichotomous indicants in that they tell 
us only which associations are above and 
which are below the threshold reflected 
by the response measure. A subject 
either recalls a nonsense syllable or he 
does not recall it. No further differentia- 
tion of associative strength is obtained 
with a recall score. The same is true for 
indicants of anticipation and recognition, 
though the particular threshold dividing 
correct from incorrect responses may be 
different for each of these indicants, and 
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in the case of recognition scores will 
depend upon the difñculty of the differ- 
entiation required by a particular test. 
The sensitivity of each of these measures 
to losses of associative strength changes 
in time. When material has been greatly 
overlearned with respect to a given meas- 
ure, so that most of the associations are 
far above threshold strength, the meas- 
ure is insensitive. This means that the 
strength of the associations may weaken 
considerably in time, but the slope of 
the curve based upon this measure will 
not show it because only a few of the 
originally, weakest associations will be 
weakened sufficiently to pass below the 
threshold. The slope of the curve does 
not directly reflect the amount by which 
associations are weakened in time. The 
slope of the curve simply reflects the 
number of associations passing below 
the threshold per unit of time. Assuming 
greatest frequency for associations 
which are originally of medium strength, 
the measure will become more sensitive 
as time passes and these associations are 
further weakened. Still later, when the 
bulk of the distribution has already 
passed below the threshold, the measure 
again loses sensitivity, and the curve 
flattens out. If the anticipation -and 
recognition thresholds for a particular 
task are widely separated, the time 
periods of maximal sensitivity for the 
respective curves will differ greatly, and 
as a consequence the slopes of the re- 


spective curves during a given time” 


period will be very different. This has 
been commonly observed and has led to 
the mistaken conclusion that recognition 
measures yield curves which differ, per 
se, from those obtained by means of 
free recall or anticipation measures. Tf 
threshold level and the degree of learning 
with respect to the threshold are com- 
parable, however, as is the case for the 
data reported by Bahrick and Bahrick, 
the slopes of the resulting anticipation 
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and recognition curves are comparable 
also. 

Retention curves based upon anticipa- 
tion, recognition, or free recall reflect the 
changing sensitivity of dichotomous 
measures. No general conclusions re- 
garding relative amounts of forgetting at 
different periods of time are permissible 
on the basis of such a curve. Such con- 
clusions are valid only if they are based 
upon equal-interval scales. A dichoto- 
mous measure based upon a low thresh- 
old might show maximal forgetting over 
a time period during which the curve 
based upon a higher threshold has al- 
ready approached an asymptote of zero. 

A completely comparable analysis has 
been made for learning curves based 
upon time-on-target scores and other 
dichotomous indicants (Bahrick et al., 
1957). The slopes of these curves could 
be predicted with good precision because 
evidence was available regarding empiri-- 
cal (near normal) error-amplitude distri- 
butions. The distribution of the strength 
of associations is not precisely known 
for any stage of forgetting, but it is 
probably not normal if learning is termi- 
nated on the ‘rst trial on which all 
associations are above threshold 
strength. Precise prediction of the 
slopes of retention curves based upon 
dichotomous measures will have to wait 
until empirical distributions of associa- 
tive strengths reflecting inter- and intra- 
individual differences are obtained for 
various types of material and for differ- 
ent degrees of original learning. 

It would follow from the preceding 
analysis, however, that the degree of 
homogeneity, as reflected by the com- 
bined inter- and intraindividual variance 
of the strength of various associations 
in a list, would partly determine the 
slope of retention curves based upon 
dichotomous indicants. The greater the 
variance, the fewer associations would 
pass through the threshold per unit of 
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time, and thus the less steep the slope 
of the retention curve. Conversely, if 
all associations in a list had exactly the 
same strength, that is, if the variance 
were zero, a step-function drop from 
100% to 0% retention would be pre- 
dicted for a curve of an individual sub- 
ject regardless of the rate at which as- 
sociations are actually weakened in time. 
The time of this drop would of course 
depend upon the height of the particular 
threshold. 

The strength of various associations 
within a list does not necessarily weaken 
equally in time. Rather, inter- and intra- 
individual variance may change as a 
function of differential interference 
processes, and in this way bring about 
curves of different slope for material of 
varying length, similarity, and meaning- 
fulness. The interpretation of these 
curves, however, must take into account 
the scoring artifacts which have been 
discussed. This analysis cannot be di- 
rectly applied to curves based upon sav- 


ings scores, the only nondichotomous 
indicants of retention. 


REFERENCES 


ACHILLES, E, M. Experimental studies in re- 
call and recognition. Arch. Psychol., 1920, 
6, No. 44. 

ANDREAS, B. G. Experimental psychology. New 
York: Wiley, 1960. 

Banrick, H. P., & Banricx, P. O. A re-exami- 
nation of interrelations among measures of 
retention. Quart. J. exp. Psychol., in press. 

Banrick, H. P., Frrts, P. M., & Bricos, G. E. 
Learning curves: Facts or artifacts? Psy- 
chol. Bull., 1957, 54, 256-268. 

Britt, S. H. Retroactive inhibition: A review 
of the literature. Psychol. Bull, 1935, 32, 
381-440. 


Crarxe, H. M. Recall and recognition for 
faces and names. J. appl. Psychol., 1934, 18, 
757-763. 

Deese, J. The psychology of learning. New 
York: McGraw-Hill, 1958. 

EspincHaus, H. Memory: A contribution to 
experimental psychology. (Trans. by H. A. 
Ruger & C. E. Bussenius) New York: 
Teachers College, Columbia University, 1913. 

Horiwcworty, H. L. Characteristic difference 
between recall and recognition. Amer. J. 
Psychol., 1913, 24, 533-544. 

LEHMANN, A. Ueber Wiedererkennen. [Con- 
cerning recognition.] Phil. Stud. (Wundt), 
1888-89, 5, 96-156. 

Luu, C. W. The conditions of retention. Psy- 
chol. Monogr., 1922, 31(3, Whole No. 142). 

MILER, ADRIENNE. Vergleich der Vergessenskur- 
ven fiir Reproduzieren und Wiedererkennen 
von sinnlosem Material. [Comparing reten- 
tion curves for recall and recognition of 
meaningless material.] Z. exp. angew. Psy- 
chol., 1957, 7, 29-38, 

Myers, G. C. A comparative study of recog- 
nition and recall. Psychol. Rev., 1914, 21, 
442-456. 

Postman, K. Retroactive inhibition in recall 
and recognition. J. exp. Psychol., 1952, 44, 
165-169. 

Postman, L., & Rau, L. Retention as a func- 
tion of the method of measurement. U. 
Calif, Publ. Psychol., 1957, 8, 217-270. 

Postman, L. W., JENKINS, O., & POSTMAN, 
D. L. An experimental comparison of active 
recall and recognition, Amer. J. Psychol, 
1948, 61, 511-520. 

Sewarp, G. H. Recognition time as a measure 
of confidence. Arch. Psychol., N.Y., 1928, 
No. 99. 

STALNAKER, J. M. Recognition and recall in a 
vocabulary test. J. gen. Psychol., 1935, 46, 
463-464. 

Srronc, E. K. The effect of time-interval upon 
recognition memory. Psychol. Rev., 1913, 
20, 339-372. 

Unverwoop, B. J. Experimental psychology. 
New York: Appleton-Century-Crofts, 1949. 


(Received September 7, 1962) 


Psychological Bulletin 
1964, Vol. 61, No. 3, 195-198 


NOTE ON MULTIVARIATE METHODS FOR 
PROFILE ANALYSIS* 


JOHN E. OVERALL 
Kansas State University 


Caution must be exercised in the use of the d statistic and related pro- 
cedures for the evaluation of profile data. The Pythagorean distance 
model requires comparable scale units and zero correlations among 
profile elements. A general model which can be employed appropriately 
with correlated multivariate data is described. The d statistic defined as 
the square root of the sum of squared differences between scores on 
separate profile elements in shown to be a special case requiring stringent 
assumptions if the resulting value is to be interpreted as representing 
interprofile distance in Euclidean space. 


In a recent issue of the Bulletin, Nun- 
nally (1962) has attempted to place 
profile analysis within “a mathematical 
and statistical framework.” While he 
demonstrated a very interesting relation- 
ship between distance measures of pro- 
file similarity (Osgood & Suci, 1952; 
Cronbach & Gleser, 1953) and results 
obtained from inverse factor analysis 
methods (Stephenson, 1953), his work 
lends sanction to a very specialized ap- 
proach which has in the past been em- 
phasized in the pages of this journal to 
the exclusion of more general ap- 
proaches. The purpose of this note is to 
recommend an alternative, more general 
model which has not been given adequate 
treatment by other writers. 

The distance between two points in 
multidimensional space can be defined 
according to the Pythagorean theorem 
as the square root of the sum of squared 
differences in projections upon uncor- 


related (comparable unit) reference 
axes. The d statistic, and hence Nun- 
nally’s “raw score” factor analysis, 1s 


ha geometric 
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square root of the sum of squared dif- 
ferences between corresponding scores 
in the two profiles. The conception of 
the d statistic as representing distance 
in multidimensional space thus implies 
that the profile elements are uncorrelated 
and are expressed in comparable units. 

The necessity for equivalence of scale 
units is easily recognized. Hardly any- 
one would fail to object to adding 
squared inches to squared feet or 
squared centimeters in obtaining a single 
distance index. In what units would the 
resulting distances be represented? The 
major problem is not simply that the 
scale of measurement will be arbitrary 
and unlike that of any of the original 
measures; the problem is that the arbi- 
trary units differ for different profile 
pairs depending upon the relative degree 
of difference between them on the several 
separate profile components. Moreover, 
we recognize that it is very difficult to 
justify the assumption that scale units 
are comparable in most areas of psycho- 
logical measurement. A solution which 
has frequently been employed is to trans- 
form all scales to standard deviation 
units. While this approach may not 
provide a completely satisfactory sol 
tion, it does introduce some meaningfu 
basis for assuming comparability. 


‘An even more difficult problem arises 
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from heterogeneous _intercorrelations 
among profile elements. Distance is de- 
fined as the square root of the sum of 
squared differences in projections on un- 
correlated reference axes. If correlations 
between profile elements exist, then the 
distance between two profiles in multi- 
dimensional measurement space is not 
the square root of the sum of squared 
differences in scores on the correlated 
profile components. If differential or 
heterogeneous intercorrelations exist, a 
very distorted picture of true distances 
between individuals may result from the 
simple summing of squared differences 
on the separate measures. 

Cronbach and Gleser (1953) have dis- 
cussed the problem of correlations among 
profile elements. They indicate that the 
d statistic computed over correlated pro- 
file elements is equivalent to a d statistic 
computed from the underlying orthog- 
onal factor variates, each weighted ac- 
cording to the portion of the total vari- 
ance accounted for by that factor. A 
major difficulty is that we seldom know 
the extent to which underlying orthog- 
onal components are reflected in the 
profile elements so that we do not know 
what is being weighted by how much. 
What is more, this uncertain weighting 
of orthogonal components is precisely 
like computing interprofile distances 
from measurements which, while uncor- 
related, are not of comparable units. 
Thus, we do not seem to avoid difficulty 
in relating the d statistic to the geomet- 
ric model when profile elements are not 
independent, If one wishes to weight 
factor dimensions differentially, the more 

precise approach would seem to involve 
computing the orthogonal components 
and then weighting them with full knowl- 
edge and freedom concerning choice of 


the weighting factors.” 
2 Jn later writings some of the investigators 
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Generalized Multivariate Approach to 
Profile Relationships 


An approach which appears somewhat 
more adequate than the d statistic, or 
the inverse factor analysis, is based upon 
a generalization of the distance function 
approach to the study of group differ- 
ences (Mahalanobis, 1936). Consider 
the multivariate distribution of p cor- 
related measurements over a heterogene- 
ous population of individuals. The 
squared distance between any two pro- 
files can be expressed in terms of a single 
linear combination of the multiple cor- 
related measurements in units of the 


population variance for that composite 
measure. 


Xaid;)* 


Laiajo;;" 


where d; is the difference between scores 
in the ith profile element for two individ- 
uals, a; is the ith compounding coeffi- 
cient, and o; is the covariance between 
the ith and jth profile elements. 

Taking partial derivative with respect 
to the a; and setting equal to zero re- 
sults in a set of simultaneous equations 
which can be solved to obtain values for 
the a; which will maximize the squared 
distance between the profiles relative to 
the population variance. The solution 
is represented in the following matrix 
equation which can be solved for rela- 
tive values of a;. 


a=C"d 


where C~ is the inverse of the variance- 
covariance matrix for the p profile ele- 
ments, and d is the vector of difference 
scores. 

The squared difference between pro- 
files in the correlated measurement space 
is then 


D?=0,d,+0gd2+. . .+Qpd,=d'C 1d, 


have become concerned with problems inher- 
ent in the approach (Cronbach, 1958). 


MULTIVARIATE METHODS FOR PROFILE ANALYSIS 


Obviously, if the profile elements are 
uncorrelated and have equal variance, 
the matrix C+ will be proportional to 
an identity matrix; for example, C= 
k-I. In this special case, D? is simply 
the sum of squared differences between 
the separate profile elements. 


Di=dCrd=dd= $ a. 
i= 

This is the special case with which the 
d statistic and Nunnally’s “raw score” 
factor analysis are concerned. When the 
conditions of equal variances and zero 
covariances are reasonably approached, 
the specialized solution may prove quite 
useful. On the other hand, it would 
appear important to be aware of con- 
ditions under which the less general ap- 
proach may prove misleading. 


Vector Product Approach to Profile 
Relationships 


It has been considered that the 
squared distance between two points in 
multidimensional space can be repre- 
sented as 


D*=d'C"\d. 


Recognizing that the vector d is the 
difference between two score vectors, we 
can also represent the distance between 
the two profiles in terms of their respec- 
tive distances from origin and the angle 
between the vectors. 

2 = (xi— 32) C7 (41-2) 

=x Ctr HaC ra 2410 Xe 


where «,C7x,=L,? and x/.C1x.=L,* 
represent the squared distances of the 
respective profiles from origin (i.e., the 
length of the vectors) and xC tt 
cosine ©: L, L, represents the cosine of 
angular separation times the product of 
the vector lengths. 

Considering a profile to be a point in 
multidimensional space, we recognize 
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that it can be defined completely by 
direction and distance from origin. 
Thus, the three characteristics of 
“level,” “variance,” and “pattern” (Os- 
good & Suci, 1952) are replaced by the 
two factors of “direction” and “dis- 
tance” from origin. 

In some problem areas, and psychiat- 
ric classification may prove to be one, 
direction from origin may be the only 
important basis for judging profile 
similarity. Patients are of a similar 
type when their profiles tend to be pro- 
portional irrespective of distance from 
origin. It is possible to eliminate dis- 
tance-from-origin as a variable in judg- 
ing profile similarity by simply scaling 
all vectors to unit length. This is ac- 
complished by dividing each profile ele- 
ment by the square root of x'iC7x;= L°. 


D?=4',Cx, tC 297, C“AXe 
=2(1—2',C-lx.) 
where x’,C'v, =2'-C-14.=1.0, 


Normalizing the profile vectors is equiv- 
alent to projecting all profile points to 
the surface of a unit radius hypersphere. 
The distances between points are then 
solely a function of the angles between 
the vectors. 

Tn the special case where profile ele- 
ments have equal variances and zero 
covariances (e.g., the matrix C~ is pro- 
portional to 7), the squared distance ex- 
pressed in terms of vector lengths and 
cosines is 


D*=3X1°4+3X.2—-23XiXs. 


This is where the raw cross-product 
analysis proposed by Nunnally comes 
into the general picture. val 
Normalizing profile vectors by divid- 
ing each element by the square root of 
the raw sum of squares for all elements 
within the profile results in 
pt=2(1-2XiX2); 
where 3%, =3X=1.0. 
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Thus, where it can be assumed that pro- 
file elements are uncorrelated and have 
equal variance, normalizing profile vec- 
tors results in D? values which are a 
function of the raw cross-product term 
only. 


Factor Analysis Approach to Profile 
Relationships 


Direct evaluation of D*=d'C-d may 
prove to be computationally impractical 
when the number of profiles, or the num- 
ber of elements within profiles, is large. 
It is possible to transform any set of p 
correlated measurements into q&p 
mutually uncorrelated variates having 
unit variance (Rao, 1952, p, 345). Any 
method of orthogonal factor analysis can 
be employed to accomplish this purpose 
provided that the total variances of the 
profile elements are included in the ma- 
trix which is factored (Overall, 1962). 
For the uncorrelated, unit variance fac- 
tor variates, the variance-covariance 
matrix will be an identity, and D? can 
be obtained as the sum of squared dif- 


ferences in scores on the uncorrelated 
factor variates, 


D=d'C*d=d'd=de+d2+. . +d, 


where d is the difference vector for un- 
correlated, unit variance factor scores, 

In some instances we may be con- 
cerned with the “psychological” distance 
between objects or persons which may be 
different from the value obtained by 
giving equal weight to the orthogonal 
factor variates. For example, we may 
conceive that the psychological judg- 
ment of distance is based upon differ- 
ential perceptual weighting of various 
factor dimensions. A weighted distance 
function may be desired. 


D?=a,d,?+ a,d.?+ oe bard? 


where D? is the square of the perceived 
psychological distance between two ob- 
jects, the d;* are squared distances along 
independent factor dimensions, and the 
a; are perceptual weighting factors. 

If we have global distance judgments 
and simple distances between a number 
of pairs of objects on uncorrelated factor 
dimensions, we can form a set of n equa- 
tions in 7 unknown and solve for the 7 
values of a; which will minimize error in 
accounting for “psychological” distance 
in terms of the simple factor dimensions. 
This would seem to be a more precise 
solution to the problem of perceptual 
weighting of independent factor dimen- 
sions than is afforded by the d statistic, 
which weights independent factor dimen- 
sions according to the arbitrary degree 
of representation of the factor dimen- 
sions in the profile. 
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COLOR AS AN INDEPENDENT VARIABLE IN 
PERCEPTUAL RESEARCH 
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A summary of the literature in which color or its characteristics (hue 
luminance, saturation) have been treated as an independent variable 
with dependent variables of apparent distance, apparent size, apparent 
weight, apparent temperature, and visual acuity. Luminance appears 
to be the major cue to apparent size, distance, or weight. Hue is the 
more important cue to apparent temperature. A chromatic figure seen 
against an achromatic background produces best acuity. Implications of 


this literature are discussed. 


Historically, visual perception, par- 
ticularly space perception, has been de- 
scribed in terms of stimulus cues. These 
have been treated as independent varia- 
bles in experiments in which judgments 
of size, distance, or acuity were the 
dependent variables (Bartley, 1958; 
Dember, 1960; Gibson, 1950; Graham, 
1951). Color perception typically has 
been treated as a dependent variable, 
although studies of the cues of aerial 
perspective or light and shade might 
be considered to include certain char- 
acteristics of color. Koffka (1935) has 
pointed out that “the different aspects 
of our visual world, size, shape, colour, 
orientation, and localization, are con- 
stituted in a thoroughgoing mutual inter- 
dependence [p. 265]. This suggests that 
color perception should also be investi- 
gated as a stimulus cue in order to deter- 
mine its influence on perception. The 
present paper summarizes the literature 
in which color has been treated as an in- 
dependent variable in regard to depend- 
ent variables of apparent distance, ap- 
parent size, apparent weight, apparent 
temperature, and visual acuity. 


EFFECTS ON APPARENT DISTANCE 


Two colored stimuli adjacent to each 
other may apper to lie in different planes. 
Three explanations have been offered 


for this phenomenon: (a) Chromatic 
aberration, in the refractive media of 
the eye radiation of short wavelengths 
(ie., producing blue), focuses images 
within the eye closer to the lens than 
radiation of long wavelengths (i.e., 
producing red). The same sort of effect 
appears for distance, as radiation from 
distant targets projects images closer to 
the lens than does radiation from nearby 
targets. This forms a possible basis for 
learning to discriminate (Bartley, 1958, 
p. 165). Thus, a red object would appear 
to be closer than a blue object of the 
same physical size and physical distance. 
(b) Eindringlichkeit—Katz (1935) be- 
lieved that colors vary in a quality which 
he called Eindringlichkeit (translated 
“insistency” after Titchener, 1916) and 
that this quality affects the apparent 
distance to which the color will be 
referred. Koffka (1935, p. 113) and 
Titchener (1916, p. 55) both pointed 
out that insistency varies with intensity 
as well as with other qualities (eg. 
whiteness for Koffka, clearness conjoin 
with quality, or with intensity, or with 
quality and intensity together for 
Titchener). (c) More recently, a third 
explanation has been offered (Smith, wW. 
M., as quoted in Ittelson, 1960, P. 101). 
This suggests that the more a color 
approximates the background (either in 
hue, luminance, saturation, or a com- 
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bination) the more it appears to recede 
towards the background. The more a 
color differs from the background, the 
farther it stands out from the back- 
ground. Thus, of two colored stimulus 
objects, the one most like the background 
would appear to be more distant than 
the other. 

Most studies in the literature (Bel- 
aiew-Exemplarsky, 1925; Johns & Sum- 
ner, 1948; Katz, 1935; Leventhal, 1953; 
Luckiesh, 1918; Mount, Case, Sander- 
son, & Brenner, 1956) reported objects 
of longer wavelengths to be perceived 
as nearer than objects of shorter wave- 
lengths. Zajac (1953) reported the same 
finding when monocular distance judg- 
ments are made through a colored filter. 
Zajac took this to support the theory of 
chromatic aberration as a cue to depth. 

Pillsbury and Schaeffer (1937) found 
results which contradict the usual find- 
ing. Using red and blue lights, equated 
in intensity as well as possible, and 
compensating size of the lights at differ- 
ent distances to give the same retinal 
image regardless of distance they found 
that 11 of 15 observers reported blue to 
be nearer than red although the red 
light was actually nearer. All of the 
observers mentioned luminance of the 
color that they judged to be nearer as 
the reason for that judgment, If the 
blue light actually was of higher lumi- 
nance than the red light then Pillsbury 
and Schaeffer’s results are understand- 
able, as colors of higher luminance 
appear nearer than colors of lower 
Juminance. 

Early studies of color effects did not 
equate stimulus objects in intensity. 
Mount et al. (1956) overcame this objec- 
tion. A standard stimulus, either a 

bright gray or a dark gray square, was 
placed 200 feet in front of the observer. 
Four hues (red, yellow, green, and blue) 
and four grays, chosen so that one of 
the grays matched one of the hues in 
luminance, were used as comparison 


stimuli. Each hue was perceived as 


closer than its corresponding gray. The 
magnitude of this effect was 3 feet or 
1.5% of the standard distance. All of 
the comparison stimuli, both gray and 
hued, appeared closer when compared 
with the darker standard stimulus than 
when compared with the lighter one. 
This contradicts results obtained by 
Edwards (1955) who reported failure 
to find consistent evidence that some 
colors are drawn nearer to the observer 
than others on the Howard-Dohlman 
apparatus when they are compared with 
a gray standard. Mount and associates’ 
study indicated that hue does have an 
effect upon apparent distance, but the 
experimental design did not permit an 
estimate of the difference in effect among 
hues. 

Luminance appears to be a major cue 
to distance regardless of hue effects 
(Ames, 1946; Ashley, 1898; Pfeiffer, 
1937). Vernon (1952, p. 105) reported 
that, in general, of two light boxes of 
different luminance, the higher lumi- 
nance appeared nearer, although the 
ratio of apparent luminances varied at 
different distances of the two boxes. 
Taylor and Sumner (1945) placed col- 
ored papers on the movable pole of the 
Howard-Dohlman apparatus. The ob- 
servers equated it in distance with a gray 
stationary pole. All colors were seen 
behind the gray pole. Luminance seemed 
to provide the major cue, as a rank-order 
correlation of .99 was obtained between 
luminance measures of the colors and 
their average distances from the station- 
ary pole. Johns and Sumner (1948) 
verified this result using the same appa- 
ratus but placing the colors on the sta- 
tionary pole and the gray on the movable 
pole. Luminance correlated .66 with 
distances of colors from the observer’s 
eyes. In these studies, colors of higher 
luminance appeared nearer than colors of 
lower luminance at a constant distance. 

Coules (1955) investigated the rela- 
tionship between luminance and appar- 
ent distance further by varying the 
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luminance ratios between a stationary 
standard stimulus and a movable vari- 
able stimulus. The variable stimulus at 
each distance presented a constant size 
to the observer’s eyes. When the variable 
stimulus was of higher luminance than 
the standard stimulus, regardless of its 
distance from the standard, more judg- 
ments of the variable’s appearing nearer 
were made under monocular conditions 
than under binocular. This suggests 
that the cue of luminance affects distance 
judgments independently of binocluar 
cues. When the two stimuli were of the 
same luminance, the frequency of re- 
sponses, indicating the variable stimulus 
to be nearer than the standard, did not 
vary as the variable was brought nearer. 
Under both monocular and binocular 
conditions, the percentage frequency of 
responses, of the variable stimulus’ ap- 
pearing nearer than the standard, in- 
creased as the logarithm of the ratio be- 
tween the two ]uminances was increased. 
In a second experiment, Coules verified 
this relationship at three absolute lumi- 
nance levels. 


Errects ON APPARENT SIZE 


Warden and Flynn (1926) had ob- 
servers rank eight cartons covered wi 
colored Hering papers according to 
apparent size. They found no effects on 
apparent size due to color, although 
they reported an effect due to specific 
serial arrangement of the colors while 
the observer was ranking them. Gund- 
lach and Macoubrey (1931) replicated 
this study using cartons covered with 
Milton Bradley papers. Regardless of 
arrangement, these authors found a sig- 
nificant effect of color on apparent size. 
To test this further they took paired- 
comparison data from the same observ- 
ers on the same stimlui. Correlations of 
ranks from the first part of the experi- 
ment with those from the paired-com- 
parison data was .93. 

Bevan and Dukes (1953) tested color 


effects by mounting rectangular stimulus 
cards of four different sizes and four 
different colors in a natural setting at dis- 
tances of 18-52 feet from the observer. 
The observer selected a card the same 
size as the stimulus card from among 14 
neutral gray reference cards. Sizes of red 
and yellow stimuli were overestimated 
and were different from size estimates of 
blue and green stimuli which showed no 
reliable difference between estimated size 
and objective size. No significant dif- 
ference was found between size estimates 
of red and yellow stimuli nor size esti- 
mates between blue and green stimuli. 
Deviations of estimated size from actual 
size varied with objective size but the 
interaction between objective size and 
color was not significant. Unfortunately, 
the authors did not test for the inter- 
action between distance and color in 
evaluating effects on size. 

The studies described above and 
several other studies -(Franklin, 1956; 
Kanicheva, 1938, 1939; Mitra & Datta, 
1939; Sato, 1955) have reported color 
to have an effect on apparent size. To 
date, no one has yet identified the effect 
of each dimension of color (hue, lumi- 
nance, or saturation); although studies 
by Gundlach and Macoubrey (1931), 
Sato (1955), and Wallis (1935) have 
shown that luminance is significantly 
correlated with apparent size, objects of 
higher luminance being overestimated in 
size. Holway and Boring (1940) verified 
this with noncolored stimuli. Robinson 
(1954) found the increase in size due to 
luminance to be of the order of 3-4% 
under either monocular or binocular 
conditions. Increasing the absolute level 
of luminance of two stimuli decrease! 
overestimation of the stimulus of higher 
luminance. 

Using two levels of luminance (25 and 
025 foot-lamberts), Chalmers ( 1953) 
showed that variability of judgments 
was not increased in either monocular or 
binocular judgments as luminance mA 
reduced. The author described the effect 
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of reduced luminance as a regression to 
monocular vision, as at the lower lumi- 
nance level binocular judgments of size 
seemed to follow the visual angle when 
the standard stimulus (isosceles triangle) 
was presented at seven different dis- 
tances from 10 to 120 feet. Visual angle 
of the standard at each of these distances 
was the same. Visual angle is the only 
cue for monocular perception of the 
size of objects beyond 10 feet when 
secondary cues are removed as they were 
in this study. 

As is the case with apparent distance, 
luminance appears to be the major cue 
to apparent size. There are no studies 
in the literature of the effects on apparent 


size of varying the hue under constant 
luminance conditions, 


EFFECTS ON APPARENT WrIcut 


In an early study, Bullough (1907) 
reported that objects of different colors 


appear to be of different weights. 
DeCamp ( 


1917) using paired compari- 
sons found a slight tendency to judge a 
red or black block to be heavier than a 
yellow or blue block of the same weight 
when each colored block was compared 
with a gray block of the same size. Order 
of apparent weight from heavier to 
lighter was red, white, orange, violet, 
green, purple, blue, an 


d yellow-green, 
A set of order of merit data in which 


observers simply lifted the blocks and 
placed them in order of apparent Weight 
resulted in the order: red, yellow, green 
and blue. Large individual and sex differ- 
ences appeared in this study. Kimura 
(1950) found similar orders in apparent 
weight when an order of merit procedure 
was used and when a paired-comparison 
method was used. The order of apparent 
heaviness in his data was black, blue, 
red, violet, orange, green, yellow, and 
white. 
Warden and Flynn (1926) found that 
effects of color on apparent weight were 
greater than effects on apparent size. 


The order of apparent weight (from 
heavier to lighter) which appeared in 
their data was black, red, purple, gray, 
blue, green, yellow, and white. 

Koch (1928) used Milton Bradley 
standard colors presented at opposite 
ends of a fulcrum. The observer was 
asked to balance them. He found no 
consistent order of apparent weight for 
his observers considered as a group. 
Each individual, however, seemed to 
show a consistent order of apparent 
weights as a function of color. 

Monroe (1925) reported that appar- 
ent weight varied inversely with lumi- 
nance. Taylor (1930) comparing “look- 
ing” and “hefting” procedures found this 
to be true, if colored objects identical 
in size and shape were observed, but did 
not find this to be true if the objects 
were hefted. Intersubject reliability was 
higher with the looking procedure than 
it was with the hefting procedure, Rank- 
ings obtained by an order of merit 
procedure were similar to those obtained 
by paired comparisons. 

None of the effects reported in studies 
conducted before 1950 was evaluated 
statistically. For this reason, Payne 
(1958, 1961) carried out a series of stud- 
ies to determine effects of color on 


apparent weight by a looking procedure. 
The first study employed paired com- 
parisons. 


Median rank of apparent 
weight for each color was compared 
among observers by a coefficient of con- 
cordance which showed apparent weight 
as a function of color to be significant at 
the .01 level for three sizes of stimuli. 
Order of apparent weight was (heavier 
to lighter) blue, red, yellow, reddish- 
purple, turquoise, and green, Rank-order 
correlations between the median ranks 
of apparent weight for cues of different 
sizes showed the ranks of apparent 
weight to be highly correlated among 
stimulus sizes, A rank-difference correla- 
tion coefficient, Computed between me- 
dian ranks of apparent weight for each 
color and the rank of reflectance of that 
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color, showed that the higher the reflec- 
tance the lighter the stimulus appeared 
(p=-.94). A second experiment, using 
the method of single stimuli, verified con- 
clusions from the paired-comparison 
data. No significant relationship was 
shown between order of apparent weight 
and preference for colors. 

The single stimuli procedure was re- 
peated under controlled illumination con- 
ditions with 2-inch cubes covered with 
Munsell papers matched in luminance. 
Analysis of variance showed hues to be 
significant at the 01 level. However, 
yellow, green, and gray did not differ 
significantly from one another in their 
effect on apparent weight, nor did blue, 
red, or purple differ significantly from 
one another in their effect, although 
apparent weight of the latter group of 
hues did differ significantly from that of 
the former. This, hue does not seem to 
be as strong a cue to apparent weight as 
luminance. 

Wright (1962) verifed the effect of 
luminance using West German observers. 
He used an Osgood semantic differential 
technique. Each observer judged a single 
3-inch square surface color on each of 
24 adjective-pair rating forms. Forty- 
five chromatic colors, black, white, and 
gray, selected so as to be relatively un- 
correlated with each other on the color 
dimensions of hue, luminance, and satu- 
ration, were rated in this manner. Use 
of German observers eliminated the con- 
fusion among English-speaking observers 
about the meaning of “Jightness” (e-g-, 
does it mean light-heavy oT light-dark 
ratings?). Analysis of the partial regres- 
sion of average leicht/schwer ratings on 
hue, luminance, and saturation showed 
a well-defined luminance effect and a less 
well-defined saturation and hue effect. 


EFFECTS ON APPARENT ‘TEMPERATURE 


Writers in the early literature (Bul- 
lough, 1908; Lewinski, 1938; Newhall, 
1941; Ross, 1938; Stefanescu-Grainger, 
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1911; Tinker, 1938) are in agreement 
that, as hue moves from red to blue, 
apparent warmth moves from warm to 
cold. Morgensen and English (1926) 
asked observers to judge the temperature 
of heating coils wrapped in paper of 
different colors. The coils were in fact 
of equal temperature. Observers judged 
green and blue coils to be less cool than 
purple coils. Red, orange, and yellow 
fell between the two extremes. No 
statistical tests were presented. 

Kimura (1950) using both an order of 
merit and a paired-comparison method 
found the decreasing order of apparent 
warmth to be: red, orange, yellow, violet, 
green, black, blue, and white. 

Berry (1960) conducted an experi- 
ment in which the observer was led to 
believe that effects of colored illumina- 
tion upon tracking performance was be- 
ing studied. The observer, who drove an 
American Automobile Association auto 
trainer, was instructed to push a switch 
when the temperature in the room rose 
to a point at which he felt uncomfortably 
warm. ‘Temperature was gradually 
changed, but the changes were the same 
for each of five colors of diffuse illumina- 
tion: green, blue, yellow, amber, and 
white. Luminance of each illuminant 
was the same. There were no significant 
differences in the Discomfort Index 
(based on wet- and dry-bulb temper- 
atures at the moment when the observer 
indicated the onset of discomfort) be- 
tween the colors of illumination. At the 
close of the test, observers were asked 
to rank the five colors according to the 
amount of heat they transmitted. These 
rankings were significant (p < 01). 
Green and blue were ranked almost 
identically as the coolest, white was at 
an intermediate position, and yellow and 
amber were ranked almost identically 
as warmer. Thus, observers persisted in 
the conventional belief that green and 
blue are “cool” colors when asked to 
rank colors, although they did not expe 
rience them any differently from other 
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colors in the levels of heat the observer 
would tolerate. 

Wright (1962) analyzed his semantic 

differential data for the polar adjectives, 
kalt/warm. He found that hue, lumi- 
nance, and saturation accounted for at 
least half of the consistent variation 
while for apparent weight they account 
for about two thirds. There is a well- 
defined hue effect on apparent warmth 
independent of luminance or saturation. 
As reported in the early literature, as 
hue was moved from red to blue appar- 
ent warmth moved from warm to kalt. 
Effects of luminance and saturation were 
less definitive than those for hue. How- 
ever, increasing warmth ratings did cor- 
respond with increasing saturation. In 
judgments of apparent temperature hue 
appears to be the major cue. 


ErrEcTS on Visuat Acurtry 


The effects of stimulus color on acuity 
may vary according to the relations 
between the test object (stimulus) and 
its surround or background. The present 
paper will describe only studies in which 
a stationary stimulus figure seen against 
a flat background was employed. 

If colored illumination is used, both 
the test object and its surround are 
affected. Under these conditions Koffka 
and Harrower (1932) found acuity 
values to be higher for targets seen as 
red, white, or yellow than they were for 
targets seen as blue or black. Ferree and 

Rand (1931, 1932), using white broken 
circle test objects, found acuity to vary 
with the color at both low (.075 foot- 
candle) and higher (.3116 foot-candle) 
illumination levels. With luminance kept 
constant the order of merit of acuity 
for hues (in millimicrons) was: red 
(666), orange (624), green (522), blue- 
green (500), and blue (ABa): Tar ee 
lower illumination. At the higher illu- 
mination value the order was yellow, 
yellow-green, orange, green, anal ten: 
Blue and blue-green were not tested at 


this intensity. These orders of colors 
were the same whether saturation varied 
or was constant among the colors. How- 
ever, differences in acuity for different 
hues were less when equal saturations 
were used. Jonnard, Faillie, and Vial 
(1937), using Landolt rings, verified the 
finding that yellow illumination as com- 
pared with white illumination in differ- 
ent contrasts improved acuity of a light 
object on a dark ground. However, they 
found that it diminished acuity with a 
dark test object on a light ground. 

When the amount of illumination is 
varied, visual acuity increases monotoni- 
cally with illumination provided the 
illumination affects both the test object 
and its background so as to keep their 
luminance contrast constant (Konig, 
1897; Shlaer, 1938). Weinberg and 
Lapinskaya (1941) have established an 
equation which permits one to calculate 
visual acuity as a function of illumina- 
tion for monochromatic lights and their 
mixtures while taking account of the 
Purkinje phenomenon. The authors re- 
ported the effects of the Purkinje phe- 
nomenon on acuity to be much greater 
than its effects on luminance. 

Acuity is better if the stimulus or test 
object is colored and is seen against an 
achromatic background than it is with 
achromatic stimuli and backgrounds. 
Schwarz (1951) found acuity for 
Landolt rings seen against a dark back- 
ground to be best in the central region 
of the visual spectrum (i.e., yellows). 
Berbert (1958) verified this using two 
self-luminous dots seen against a dark 
field to test acuity. The dots were 
centered at 450, 500, 550, 600, or 650 
millimicrons in hue. Observers were 
dark adapted before the experiment. 
Each hue was tested through the whole 
available range of luminances (3-decibel 
steps) during each session. Only one hue 
was used in one session. Best acuity, 
that is, lowest angular subtense of gap 
between dots when they appeared to be 
just touching, occurred at slightly dif- 
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ferent luminance values for each hue. 
However, the best acuity occurred at 
about the same increment above its cor- 
responding operational threshold for 
each hue. The midregion of the visible 
spectrum (500-600 millimicrons) pro- 
duced better acuity than did the ex- 
tremes. In each observer the best acuity 
measured in the midregion of the spec- 
trum was superior to acuity measured 
with an achromatic stimulus. A supple- 
mentary experiment in which measure- 
ments were made with all colors and 
luminances at each session confirmed 
these conclusions. 

MacAdam (Stiles, 1947) reported that 
in Landolt rings, in which acuity meas- 
urements depend on discrimination in 
small areas, color contrast between a 
ring of one color and a background of 
another has much less effect than lumi- 
nance contrast. Using colored rings 
mounted on an equiluminous gray back- 
ground he (MacAdam, 1949) found 
colors of rings—expressed on the CIE 
(Commission Internationale d’Eclair- 
age) diagram—whose orientations were 
reported correctly as frequently as those 
of equally large gray rings that were 
lighter than the same gray background. 
Using these figures as measures of chro- 
matic contrast MacAdam concluded 
that: 

When the target and background are not equi- 
luminous, the acuity is the same as that pro- 
duced by an achromatic contrast equal to the 
square root of the sum of squares of the 
luminous contrast and the chromatic contrast 


[p. 172]. 


Luminance has been shown to affect 
visual acuity (Berbert, 1958; Berger, 
1941, 1942; Konig, 1897; Wilcox, 
1936), but the shape of the mathe- 
matical function which describes this 
relationship has been questioned. Wilcox 
(1936) reported that, for a self-illumi- 
nated stimulus (parallel bars) seen 
against a dark background, increasing 
the illumination of the stimulus first low- 


ered and then raised acuity with a dis- 


tinct minimum value at medium intensi- 
ties, that is, “best acuity” at these in- 
tensities. With a dark stimulus seen 
against an illuminated background, in- 
creasing the intensity of the background 
resulted in a continuously increasing 
acuity, but the rate of increase became 
progressively less as intensity increased. 
Wilcox interpreted his results in terms 
of irradiation effects which alter the 
apparent area of the parallel bars at 
higher illuminations. Since the minimum 
separation between the bars at which a 
gap was seen was used as a measure of 
acuity in this study, a change in ap- 
parent width of the bars would change 
the estimate of acuity, 

Berger (1942) found results with self- 
illuminated Landolt rings or with points 
observed on an almost black background, 
which contradicted those of Wilcox, In 
both points and broken circle, the mini- 
mum visual angle decreased continuously 
with illumination, rather than increased 
with higher illuminations as occurred in 
Wilcox’ study. This was true with either 
right or left eye or with both eyes. There 
were large individual differences in the 
absolute values obtained from the vari- 
ous subjects. Berger found smaller 
values with points as stimuli than he 
did with Landolt rings. Thus, he felt 
that most conventional measures of 
visual acuity should not be considered to 
measure “minimum resolving power of 
the eye,” but that acuity and minimum 
resolving power should be considered 
apart from one another. He attributed 
the difference in his findings from those 
of Wilcox to be due to glare at the higher 
illumination levels in Wilcox’ study. 

Berbert (1958) attempted to resolve 
the contradictions between the results of 
Wilcox and Berger. Using bright dots 
ona dark field as stimuli he found visual 
acuity first rose to a maximum with in- 
creasing test object luminance and then 
fell off. Maximum acuity for the dicts 
occurred near 3.2 millilamberts or at 10 


times the luminance that maximum 
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acuity occurred in Wilcox’ study. The 
difference in luminance, for maximum 
acuity when points instead of bars are 
used, is probably due to the considerable 
difference in subtense of these test ob- 
jects. 


Discussion 


The major cue to apparent size, dis- 
tance, and weight appears to be lumi- 
nance. Lighter stimulus objects appear 
to be larger, nearer, and lighter in weight 
than darker stimulus objects. This char- 
acteristic appears to function regard- 
less of hue of the stimulus object. On 
the other hand, hue is the more impor- 
tant cue to apparent temperature. Best 
acuity seems to occur with a chromatic 
figure seen against an achromatic back- 
ground. There are few data available on 
the interactions between hue and lumi- 
nance, 

The finding that luminance is the 
more important cue to apparent size, 
distance, and weight leads us to seek a 
common functional relationship for these 
three dependent variables. Interaction 
between apparent size and apparent dis- 
tance are well known (see Osgood, 1953, 
pp. 261-271). The fact that lighter ob- 
jects appear to be nearer as well as 
larger than darker objects of the same 
physical size seen at the same distance is 
simply an example of this interaction, 
In a similar manner, objects of higher 
luminance appear both larger and lighter 
than objects of the same size and shape 
which are of lower luminance. Experi- 
ments on the size-weight illusion 
(Crutchfield, Woodworth, & Albrecht, 
1955) have shown that larger objects of 

the same objective weight as smaller 
objects seem to be substantially lighter 
in weight than the smaller objects. Thus, 
we would expect influence of luminance 
on apparent weight from the ha ago 
influence of amane on apparent size 
P size-weight illusion. 

E“ (1921) introduced the concept 
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of figure-ground into the study of per- 
ception. It might be argued that any 
characteristic which made a stimulus 
object stand out from its background, 
that is, made it more “figurelike,” would 
make that stimulus object appear nearer 
that another stimulus object which dif- 
fered from it only in that characteristic. 
Katz (1935) would explain the varia- 
tion of apparent distance (and thus of 
apparent size and weight) with lumi- 
nance on this basis by pointing out that 
Eindringlichkeit varies with intensity. 
Eindringlichkeit is considered to be an 
identifiable stimulus quality. Thus, an 
object of greater luminance would have 
greater Eindringlichkeit than one of 
lesser luminance and would appear 
nearer. 

Gibson and Waddell (1952) have 
pointed out that an image becomes dif- 
ferentiated by means of “gradients of 
luminous intensity” which serve as 
transitions between relatively dark and 
relatively light regions of the visual field. 
A fully differentiated image is one in 
which the gradients of intensity are as 
steep as possible, Under these condi- 
tions the image would appear as a 
sharply defined figure against a back- 
ground. At the other extreme, if the 
gradients of luminous intensity are 
zero, then the phenomenal experience 
is that of a uniform or homogeneous 
visual field or a fully undifferentiated 
image. Under these conditions there 
would be no figure at all. Between these 
extremes the gradient of intensity should 
determine how clearly a stimulus is per- 
ceived as a figure against a background. 
The less the intensity difference between 
figure and ground the less the figure 
would be differentiated and the more 
likely it is to be seen at the same distance 
as the background. The greater the 
difference between the luminance of two 
figures seen against a common back- 
ground the farther apart the two figures 
should appear, the brighter one appear- 
ing closer than the darker one. Smith 
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(Ittelson, 1960, p. 101) supported this 
argument and generalized it to apply to 
gradients of hue and saturation as well 
as luminance. 

Future research could well examine 
the interactions among gradients of hue, 
luminance, and saturation. When we can 
identify the extent to which these stimu- 
lus dimensions function together as per- 
ceptual cues we shall have a sounder 
basis for trying to discover ow they 
came to serve as cues. 
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As a physiological measure of emotii 
neglected in recent years. This revie' 
stimuli produce replicable changes in 
influence blood pressure reactivity 


between personality and hi 


cates that the Ist 2 questions can be answered 
thodological problems yet to be resolved. No 
for the 3rd question and tangential studies 


there are a number of mei 
direct evidence is available 
offer conflicting results. 


The use of measures of blood pressure, 
as dependent and intervening variables 
in psychological research, has a long and 
exciting history dating back before the 
turn of the century. But psychologists 
do not seem to have used the measure 


n individual’s blood pressure to rep- 


of a 
resent “emotional” changes as much as 


they have used other measures, for ex- 


ample, the galvanic skin response. One 
jew is to encourage a 


aim of this rev! 
fresh look at the meaning, measures, and 
results of studies of blood pressure. 

The design of this review can be sum- 
marized in the following way- The first 
studies to be considered are pertinent to 
this question: Are there replicable 
changes in blood pressure when subjects 
are presented with certain classes of ex- 
ternal stimuli? Secondly, are there char- 


the senior author 
University of M 
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of this paper. 


ichigan. APP 
“ae f the University of 


n earlier version 


on, blood pressure has been relatively 
„w considers 3 questions: Do certain 
blood pressure? Do cognitive states 
? Is there a causative relationship 
gh blood pressure? Available research indi- 


affirmatively, though 


acteristic ways of responding to a given 
external stimulus as a function of certain 
kinds of cognitive states or processes? 
The review concludes by presenting 
studies which attempt to explore the 
major question: Do various psycho- 
logical states have possible causative re- 
lationships with levels of sustained high 
blood pressure? 

Perhaps the first review of the litera- 
ture relating psychological and physio- 
logical variables was that by Angell and 
Thompson (1899) which together with 
that of Robbins (1919) discussed vari- 
ous psychological variables as they re- 
lated to subjects’ reactions on a ple- 
thysmograph. The most comprehensive 
review was done in 1929 by Darrow. 
Recent reviews, which consider limited 
portions of the literature relating psy- 
chological and physiological variables, 
are those by Moses, Daniels, and Nicker- 
son (1956) and Shapiro (1960). Also 
available is an annotated bibliography © 


blood pressure research from 1920 to 
209 
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1950 (Koller & Katz, 1952). Because 
hypertension is more frequently found 
in Negroes than in Caucasians, there is 
a separate literature which has been 
reviewed by Schulze and Schwab (1936) 
and Lennard and Glock (1957). The 
present attempt is to provide a compre- 
hensive review of that literature which 
is specifically relevant to persons inter- 
ested in relationships between psycho- 
logical and blood pressure variables, 


EMOTION-PRODUCING STIMULI AND 
BLOOD PRESSURE RESPONSES 


Techniques for nonsurgical measure- 
ment of human arterial blood pressure 
were first described in 1896 by Riva 
Rocci (according to Bard, 1956). Given 
an instrument for easy and painless 
measurement, a variety of pressors was 
ingeniously conceived to produce changes 
in blood pressure, By “pressor” is 
meant any externally or internally ap- 
plied stimulus expected to induce a blood 
pressure response, By way of introduc- 
tion to the vast amount of literature 
in this general area, it would be useful 
to illustrate the sensitivity of blood pres- 
sure to various stimuli. 

The first published research on blood 
pressure in American psychology is that 
by Marston (1917), a Massachusetts 
lawyer, who wanted to demonstrate the 
validity of measures of blood pressure 
change for detecting when a person 
was lying. His subjects were asked to 
tell either the “truth” or a “lie,” em- 

bellished around one or two facts sup- 
plied by the experimenter. Subjects’ 
introspections were used to show that 
blood pressure increased when the sub- 
5 began to attempt to deceive and 
jects h they told the truth. 
decreased when he ob- 
it did not matter what the ol 
Actually, 1 - so long as the sub- 
jective facts were; s we 
J hey were deceiving or 
jects thought they V 
: Jling the truth their blood a 
elli e 
were affected. Langfeld repeated t 


experiment in 1920 and obtained the 
same results. 

Even dreams and music have been as- 
sociated with changes in blood pressure. 
MacWilliams (1923) found that blood 
pressure might rise while subjects slept 
and that this elevation was associated 
with subjects reporting (on being awak- 
ened) that they had been dreaming. 
Lovell (1941) showed that blood pres- 
sure went up following a loud, unex- 
pected sound. Steinman, Jaggi, and 
Widmer (1955) played subjects record- 
ings of Mozart and Beethoven and pro- 
duced a decline in blood pressure. When 
the subjects listened to jazz and Jap- 
anese music, blood pressure went up. 
The literature abounds with results 
which indicate that blood pressure 
changes may be associated with many 
forms of psychological stimuli or stresses. 


Psychological Stress and Temporary 
Elevation of Blood Pressure 


A number of researches have shown 
fluctuation of blood pressure during 
various kinds of interviews. One of the 
earliest studies of this kind in American 
research is that by O’Hare (1920) in 
which he reported that his patients re- 
acted to the discussion of relevant life 
problems with an elevation in blood 
pressure, Alexander (1939b) noticed 
that, when his patients had particularly 
disturbing sessions in analysis, their 
blood pressures were markedly elevated. 
Wolf, Pfeiffer, Ripley, Winter, and 
Wolff (1948) assert that, for their pa- 
tients, blood pressure went up when un- 
pleasant life experiences were discussed 
and went down with “nice” conversation. 
Pfeiffer and Wolff (1950) demonstrated 
rises in blood pressure when subjects 
discussed personally important topics 
with “threatening” significance. 

Perhaps the most thorough study em- 
ploying the psychiatric interview as a 
stimulus is that by Innes, Millar, and 
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Valentine (1959). The experimenters 
used simultaneous recording of the inter- 
view and physiological measurements to 
permit a matching of blood pressure 
changes with specific content areas in 
the interview. Contrary to the findings 
of other studies to be reviewed later, the 
investigators found no correlations be- 
tween specific kinds of interview content 
and blood pressure changes. Blood pres- 
sure did fluctuate according to interview 
content, but the authors were not able 
to establish a coding category which 
would fit with the blood pressure 
changes. There was, however, a signifi- 
cant relationship between the use of 
self-referent pronouns (I, my, etc.) and 
increases in blood pressure, regardless 
of the content of the sentence in which 
the pronoun appeared or the context of 
the discussion. Innes, Millar, and Valen- 
tine’s subjects were both normals and 
neurotics, and it is possible that such a 
mixture obscured differences between the 
two groups. Their failure to create a 
code which would work may be viewed 
as a criticism of much of the work using 
the interview technique. 

There are some more recent studies 
which take advantage of objective 
events posing threats to the physical or 
psychological life of people who experi- 
ence them. For example, Graham 
(1945) measured the blood pressure of 
soldiers who had seen long combat in 
the African desert campaign in World 
War II and found 279% showed high 
diastolic blood pressure (> 100 mili- 
meters) for several months afterward. 
Ruskin, Beard, and Schaffer (1948) 
found that the diastolic blood pressure 
of 56% of Texas City survivors was 
markedly elevated (>95 millimeters) 
for about 1-2 weeks after the tre- 
mendous explosion there. A rather un- 
usual experimental environment was 
created as a stimulus for parachutists by 


Grimak (1959). He hypnotized these 


men and had them relive a jump experi- 
ence. Continuous blood pressure read- 
ings were taken. Blood pressures went 
up when the plane took off, stayed high 
during the flight, went down when the 
subjects jumped and felt their chute 
open. Readings after subjects hit the 
ground were still higher than those taken 
before takeoff. 


Specific Blood Pressure Responses to 
Emotion-Producing Stimuli 


Cannon (1932) suggested that emo- 
tion was a generic term to apply to 
an emergency pattern of physiological 
changes regardless of the psychological 
components, for example, anger or fear. 
Ax (1953) attempted to explore physio- 
logical differences accompanying stimuli 
specifically designed to arouse anger 
and fear separately. The anger stimulus 
consisted of insulting behavior to the 
subject by a laboratory technician who 
alternately jostled, verbally insulted, 
and was generally nasty. 

The fear stimulus consisted of a gradual in- 
termittent shock stimulus to the little finger 
which never reached an intensity sufficient to 
cause pain. When the subject reported the 
sensation, the experimenter expressed surprise, 
checked the equipment, pressed a key which 
caused sparks to jump near the subjects, then 
exclaimed with alarm that this was dangerous 
high voltage short circuit [Ax, 1953, p- 35]. 


The same subjects were used in both 
the anger and fear situations. Fourteen 
physiological measures were taken, and 
frequency of changes, rather than ampli- 
tude, was used to measure response. The 
number of diastolic blood pressure, heart 
rate, galvanic skin response, and muscle 
tension increases was greater for anger 
than for the fear condition. Similarly 
Schachter (1957) exposed subjects to 
situations intended to produce emotions 
of pain, fear, and anger in that order. 
For the pain experience, subjects kept 
their hands in a bucket of ice water (3 
degrees centigrade for 1 minute). The 
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anger and fear situations were similar to 
those used by Ax. Subjects responded 
to these stimulus conditions with a 
significant increase in diastolic blood 
pressure to the pain and anger condi- 
tions and a significant systolic increase 
to the fear condition. Another experi- 
ment by Funkenstein, King, and Dro- 
lette (1957) used frustrating tasks to 
induce anger. Results were similar to 
Schachter’s. 

Buss (1961) compares the results of 
the above three studies in terms of blood 
pressure changes. He asserts that in 
general these experiments support the 
notion that physiological emotion ap- 
parently is not as Cannon hypothesized, 
that is, all the same emergency pattern. 
The results also indicate that for fear 
(or anxiety) stimuli, at least, there may 
be a consistent blood pressure pattern of 
response: a rise in systolic pressure (and 
also heart rate) and a smaller increase 
in diastolic pressure. The results de- 
scribing the blood pressure patterns ac- 


companying anger are less consistent in 
these three studies, 


PSYCHOLOGICAL FACTORS as 
INTERVENING VARIABLES 


In this section studies shall be re- 
viewed in which psychological variables 
are used as intervening variables in the 
research. A frequently used clinical 
technique for assessing the dependent 
variable, blood pressure reactivity, js 
the cold pressor test. The subject im- 
merses his hand (or foot) to the 
wrist in crushed ice and water at a 
temperature of 4-6 degrees centigrade 

t least 1 minute. Subjects experi- 
ee ious degrees of pain, but with no 
pa All subjects show some in- 
oes in blood pressure. While reaction 
oe utd rate of recovery may ` 
nought by many psychologists to ° 

i ith this physica 
important variables with | Koger 
pressor, most research with 


pressor test has used amount of change 
from resting level as the significant 
variable. This is called “reactivity” as 
contrasted to the absolute blood pres- 
sure level reached. 

There are several studies which seem 
to show that reactivity to the cold 
pressor test varies according to the psy- 
chological characteristics of the subjects. 
For example, White and Gildea (1937) 
observed that patients with somatic 
symptoms of anxiety (e.g., finger twitch- 
ing) showed a greater reaction to the 
cold pressor test than did patients with- 
out these symptoms. Malmo and Sha- 
gass (1952) compared the reactivity of 
various types of Psychiatric patients. 
Subjects diagnosed as neurotic showed 
greater increases of blood pressure on the 
cold pressor than did psychotics or a 
control group of normals and took longer 
to recover. More recently Harburg 
(1962) observed a significant association 
between college males who reported 
higher hostility in minor interpersonal 
conflicts and higher diastolic rise on the 
cold pressor test, 

Cranston, Chalmers, Taylor, Hen- 
schel, and Keys (1949) combined the 
cold pressor test and a stressful inter- 
view. Subjects were given six to eight 
cold pressor tests on different occasions 
to establish their usual degree of reactiv- 
ity. Subjects then experienced an inter- 
view in which they talked about topics 
which the investigators judged to be 
emotionally relevant for them. The in- 
terview is reported to have evoked = 
tility and resentment in most mam 
Immediately after the interview, the su 
jects again took the cold pressor test. 
Blood pressure levels were significantly 
higher. The process by which an emo- 
tion such as hostility or resentment 
heightens the effects of pain is not 
clearly understood. Also, it might be that 
the cold pressor was merely the final 
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stress necessary to produce an elevation 
of blood pressure. 

Other kinds of physical stress condi- 
tions are employed to produce blood 
pressure reactions. For example, Steven- 
son, Duncan, Flynn, and Wolf (1952) 
had subjects perform a standardized 
exercise, taking blood pressures before 
and after. Subjects who were rated by 
psychiatrists as “relaxed” showed less 
increase in blood pressure than those 
rated as “preoccupied” or already in 
therapy for anxiety symptoms. Thur- 
rell, Greenfield, and Roessler (1961) 
rated psychiatric patients (none of 
whom were on drug therapy) on degree 
of “psychological responsivity” based on 
a 15-minute interview. Subjects rated as 
“hyperresponsive” showed a greater 
variation of blood pressure from a base 
line following an injection of mecholyl 
than did those rated “hyporesponsive.” 


Are THERE PSYCHOLOGICAL DETER- 
MINants oF HrcH BLOOD PRESSURE? 


Much of the research linking psycho- 
logical variables and blood pressure has 
been done in an effort to discover the 
“cause” of permanent high blood pres- 
sure or hypertension. Such studies were 
begun on the hunch, and some evidence, 
that there were behavioral differences 
between persons with high blood pres- 
sure and those with low. For example, 
Alexander (1902) was among the earliest 
to note differences between the blood 
pressures of persons committed to mental 
hospitals and blood pressures of the gen- 
eral population, with the patients tend- 
ing to have higher blood pressures. In 
1919 Moschowitz drew up 4 personality 
description of hypertensives based on 
observations made on his patients. The 
hypertensive was described as: over- 
weight, sluggish, slow and ungraceful; 
tense and irritable; with narrow intel- 
lectual horizons and no hobbies; sleep- 
ing badly; having lost the ability to 


enjoy life, even though financially suc- 
cessful. Moschowitz carefully avoided 
asserting this syndrome as the cause of 
high blood pressure. Other examples of 
this sort of research are studies by 
Malmo and Shagass (1950) and Bene- 
dict (1956). 


Specific Personality Differences of High 
and Low Blood Pressure Groups 


Using a combination of self-ratings 
and ratings by two peers, Hamilton 
(1942) found that young high blood 
pressure subjects are more frequently 
described as submissive, introverted, less 
self-assertive, less self-confident, slow, 
lethargic, and getting less enjoyment 
from the opposite sex. These results 
seem similar to those found in a popula- 
tion of college males by Harburg, Julius, 
McLeod, McGinn, and Hoobler (1963) 
who report positive correlations of Cat- 
tell’s 16 PF measures of “sensitivity” 
and “submissiveness” with consistently 
high systolic blood pressure levels meas- 
ured in different situations; for example, 
on a college registration line, in the 
physician’s office, and at home taken by 
the student. Higher blood pressure 
among this population was also asso- 
ciated with yielding in an argument be- 
tween peers. 

Brower (1947a) correlated blood pres- 
sure of college students with scores on 
the Rorschach and MMPI. He found a 
significant negative correlation between 
diastolic blood pressure and FC scores 
on the Rorschach—FC is considered by 
Brower to be a measure of adjustment. 
In another study, Brower (1947b) found 
significant positive correlations between 
diastolic blood pressure (he did not 
measure systolic) and the ?, Depres- 
sion, Hypochondriasis, and pipe 
pathic Deviate scales of the MMPI. 
Cattell and Scheier (1959) report cor- 


relations between systolic blood pressure 


and some of the Cattell 16 PF factors. 
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No further description is provided of the 
procedure or the subjects. They claim a 
positive correlation between high systolic 
pressure and Extraversion, A+, Q2-, 
N+, M-, E+, having bigger bones, 
exuberance, long range goals, low con- 
fidence of skill in untried performance, 
and more tremor in right hand. 
There are a few studies which report 
no differences in scores on personality 
tests between high and low blood pres- 
sure samples. For example, Innes, Mil- 
lar, and Valentine (1959) found no 
significant differences between hyper- 
tensive pregnant females (average age 
26), normotensive females after delivery 
(age 26), psychoneurotic females (age 
32), and hypertensive females (age 53) 
in scores on the MMPI. Robinson 
(1959) found no significant relationship 
between blood pressure and performance 
on the Maudsley Personality Inventory 
and the Porteus Mazes. Storment 
(1951) compared five groups of male 
hospital patients using the Guilford- 
Martin STDCR, GAMIN, and OAgCo. 
The groups of patients were: hyperten- 
sives, rheumatic heart disease, coro- 
nary occlusion, arteriosclerosis, and non- 
cardiac infectious disease patients. There 
were no differences among the five 
groups on test scores, 


Hostility and High Blood Pressure 


One of the most energetic “schools” of 
research on hypertension is that which 
promotes the repressed hostility hy- 
pothesis. Beginning with Alexander’s 
(1939a) theoretical discussion, this 
hypothesis asserts = hypertension de- 
velops because the prehypertensive per- 
son continually represses feelings of 
hostility instead of expressing them. 
This repressed hostility or ain is 

ically expressed in increased blood 
Sair hi h, over time, leads to the 
sacl yoo changes associated 


rmanent vascular 
ie hypertension. Saul (1939) treated 
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hypertensives in analysis and concluded 
that the salient similarities among hy- 
pertensives are “the prominence in every 
case of a dominating mother... - 
marked inhibition of heterosexuality, re- 
pressed hostility and being neither weak 
and dependent nor aggressively hostile 
[p. 160].” 

A number of clinical observers have 
worked along the line of Alexander’s re- 
pressed hostility hypothesis. They usu- 
ally interpret the material obtained in 
psychotherapy sessions. It will remain 
an open question as to how representa- 
tive of the general population of hyper- 
tensives are those in psychotherapy; it 
is quite likely that they are a small and 
selected part of the population. Sample 
studies in this area, yielding results 
much like Saul’s, are those by Wolf et 
al. (1948), Gressel, Shobe, Saslow, Du- 
Bois, and Schroeder (1949), and Saul, 
Sheppard, Selby, Sachs, and Master 
(1954). 

There are other studies which raise 
problems for the idea that hypertensives 
repress their hostility. Neiberg (1957) 
gave a multiple-choice form of the 
Rosenzweig Picture-Frustration Study 
before and after a criticism experience. 
There were no differences in poststress 
aggressivity between hypertensives and 
normal subjects. Schachter (1957) ex- 
posed subjects to pain, fear, and anger 
situations in that order. Judges rated 
expression of anger and fear by the 
subjects. High blood pressure subjects 
expressed more anger and fear; there 
was a positive correlative correlation be- 
tween expression of anger and blood 
pressure. A study by Kaplan, Gotts- 
chalk, Magliocco, Rohorst, and Ross 
(1961) further suggests that hyperten- 
sives are more hostile than those with 
normal blood pressure, but that they 
express their hostility in measurable 
ways. Shapiro (1960) has reviewed re- 
search relevant for the repressed hostil- 
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ity hypothesis and concludes that avail- 
able data are inconclusive. 


The Prehypertensive Personality 


In order to successfully argue that a 
particular personality syndrome pre- 
cedes the development of hypertension 
(and might therefore be causative) it is 
necessary to demonstrate that persons 
of a given personality type are more 
likely to develop hypertension than are 
others. No such research has been done. 
Indirect evidence is provided in two 
studies (Ayman, 1933; Kalis, Harris, 
Sokolow, & Carpenter, 1957) which show 
that young subjects with high blood 
pressure are more likely than normals to 
develop hypertension (eg., Levy, White, 
Stroud, & Hillman, 1945) and are sim- 
ilar in personality to older persons with 
hypertension. 

Several other examples of research 
lend partial support to the idea that 
emotional responses, conditioned by 
early learning, may in turn mediate blood 
pressure reactivity. An experiment by 
Hokanson (1961b) provided subjects 
with a stressing social situation. In- 
lic and diastolic blood 


crease in systo 
pressures under social stress were cor- 


related with scores on several tests of 
manifest hostility. Subjects high in Test 
Hostility showed greater rises in systolic 
and diastolic blood pressure. Hokanson 
argues that subjects with high Test 
Hostility scores must feel anxious OF 
guilty about expressing hostility. He 
relates this to the findings m another 
study (1961a) in which subjects who 
released aggression showed a greater de- 
cline in plood pressure after the experi- 
ment than did subjects who did pet 
aggress. Hokanson contends that ag- 
gression anxiety” acts to inhibit aggres- 
d to early emotional 


sion and is related * : 
conditioning of the child by his parents. 


Some evidence js available to sup- 
port the argument that blood pressure 


reactivity may be genetically trans- 
mitted. For example, studies by Hines 
(1940), Doyle and Fraser (1961), and 
Shapiro (1961) demonstrate that chil- 
dren of hyperresponsive parents show 
greater increases of blood pressure to 
stress than do children of normorespon- 
sive parents. However, a respectable al- 
ternative hypothesis, which fits the data 
equally well, is that hypertensive parents 
create conditions in the home which lead 
to development of high blood pressure or 
reactivity in their children. As far as is 
known, the only research relating a sub- 
ject’s reported behavior of his parents 
and his own blood pressure reactivity 
is that by McGinn (1962; McGinn, 
Harburg, & Julius, in press). He 
found that hyperresponsive college males 
described their mothers as aggressively 
rejecting them, and fathers passively re- 
jecting, as compared with normoresponse 
subjects. These data are consistent with 
the hypothesis that stressful experiences 
in childhood can lead to physiological 
overreactivity to stress in adulthood. 
Two descriptions of the process by which 
this might occur are those by Mowrer 
and Viek (1948) and Malmo (1957). 
However, there are no studies which 
have employed direct observation of 
parental behavior rather than children’s 
or parents’ recall. 

Finally, if personality characteristics 
precede development of hypertension 
and are causally associated with it, then 
one might expect to find different per- 
sonality traits among essential hyper- 
tension patients as compared with those 
whose hypertension can be ascribed to 
some structural malfunction, such as & 
dysfunctional kidney. However, Ostfield 
and Lebovits (1958) found that essential 
hypertension patients and renal hyper- 
tension patients, matched on 8° ae, 
race, and socioeconomic status, showed 
no differences in group scores on the 
MMPI or Rorschach or in response to 
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a stressful interview. It is always a 
possibility that personality characteris- 
tics are a noncausal correlate of hyper- 
tension in all stages of its development 
and that both can be attributed to some 
other causal agent. 


Effects of Environmental Stress 


One such possible agent is environ- 
mental stress. Inadequate behavior in 
social situations might have disturbing 
consequences for some people, hence the 
relationship of personality measures with 
blood pressure. For other persons, oc- 
cupational strains or other stressing 
events in the daily environment might 
occasion frequent elevation of blood 
pressure and lead to the development 
of the illness in the Way suggested by 
Malmo (1957). 

Steiglitz (1930) was perhaps the first 
to argue that repeated episodes of emo- 
tional tension could lead to permanent 
hypertension. Palmer (1937) performed 
a study with the cold pressor test and 
concluded that hypertension is a chronic 
emergency response: 

Those who have definite essential hyper- 
tension usually have experienced severe and 
prolonged mental or nervous strain and often 
have suffered from insomnia, functional gastric 
disorder, marked exhaustion and occasionally, 
more or less severe nervous breakdown [p. 
692). 


Reiser and his colleagues (Reiser, 
Brust, & Ferris, 1951, Reiser, Brust, 
Shapiro, Baker, Ranschoff, & Ferris, 
1950; Reiser, Rosenbaum, & Ferris, 
1951) have attempted to link the oc- 
currence of life crises and the develop- 
ment of hypertension by intensive clini- 

xamination of the events which 
m ded the onset of the illness. No 
ci ress seemed a necessary condi- 
the authors feel im mere 
usually represent threats n Pe am 
self-concept, such as threa 
or failure on the job. 


: . 
©" Another way to test the environmental 


specific st 
tion, but 
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stress hypothesis is to examine blood 
pressures across cultures, predicting 
that cultures characterized by highly 
stressful conditions of life will have 
more hypertensives than those cultures 
in which life is easier and less stress- 
ful. For example, Shreenivas (1951) 
found lower blood pressures among 
natives of East India than are found 
in the United States. Maddocks (196la, 
1961b) found lower blood pressures in 
South Sea Islanders than in residents 
of London, England. However, why 
should it be expected than the over- 
crowded conditions of life in the Orient 
are less stressful than those in more ad- 
vanced economies? Further, Marcussen 
(1950) discusses several cases in which 
symptoms of high blood pressure dis- 
appeared when the subject was under an 
extreme environmental stress (e.g., be- 
ing a prisoner of war of the Japanese) 
but reappeared upon re-establishment of 
a normal pattern of life. 

The most effective test of the stress- 
produces-hypertension hypothesis would 
be to expose an organism to stress and 
see whether it develops hypertension. 
Selye (1943) ; Farris, Yeakel, and Med- 
off (1945); and Yeakel, Shenkin, Roth- 
baller, and McCann ( 1948) claim that 
tats exposed to loud noises (sounds pro- 
duced by an air blast) at frequent inter- 
vals for a long period of time develop 
hypertension, Russian researchers have 
also been able to experimentally produce 
hypertension in animals using Pavlovian 
conditioning techniques, according to a 
review of their work by Simonson and 
Brozek (1959). 

However, some researchers have not 
been able to duplicate these results, 
Shapiro and Horn (1955) used condi- 
tioning techniques in an attempt to pro- 
duce experimental neuroses in cats, 
Full-blown neurotic behavior never de- 
veloped, according to the authors. 
Neither was the blood pressure of the 
conditioned animals markedly above 
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that of controls. Shapiro and Melhado 
(1958) exposed rats to various condi- 
tioning procedures designed to evoke 
“chronic anxiety.” No hypertension was 
produced, but there was evidence that 
existing vascular disease could be seri- 
ously exacerbated by the stress proce- 
dures. The Shapiro studies used a con- 
ditioning technique rather than the 
noxious stimuli procedures of Selye and 
Farris because, as Shapiro (1960) has 
pointed out, the latter involve long pe- 
riods of extreme fatigue for the animal 
in addition to “psychic” effects, thus 
clouding the interpretation of results. 
On the other hand, there is reason to 
suspect that Shapiro’s animals never ex- 
perienced the degree of stress that the 
Selye and Farris animals did and that 
this amount of stress is necessary to 
produce permanent high blood pressure. 


Discussion 


In conclusion, it should be noted that 
most of the extant research is the prod- 
uct of members of the medical profes- 
sion. For some unclear reason psychol- 
ogists lost interest in blood pressure as 
an important physiological variable to 
be related to personality factors shortly 
before 1930, Some valuable research has 
been produced by psychologists in very 
recent years, but in small quantity. One 
of the consequences of domination of 
the field by medical researchers is that 
much of the research does not meet those 
methodological criteria which psychol- 
ogists would employ. 

The following problems are seen as 
those which are most 17 need of solu- 
tion at the present time. (a) Can per- 
manent high blood pressure be induced 
in animals by techniques which are com- 


mental neuroses? (4) What, if any, 
is the relationship between childhood ex- 
s and blood pressure characteris- 


perience: p 
tics? (c) Are certain young adult per- 
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sonality types more likely to develop 
high blood pressure than others? 
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ASSESSMENT OF RISK TAKING BEHAVIOR 
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Risk taking propensity has been hypothesized to be a general personality 
disposition and many devices have been proposed for use in its assess- 
ment. Several types of studies relevant to the validity of these measures 
are reviewed. These studies expose a considerable lack of agreement 
among what are supposed to be measures of the same general character- 
istic. This implies that the domain of risk taking behavior may not be as 
conceptually unitary as many Psychologists would like to believe. Some 
explanations for this lack of agreement are offcred in an attempt to 
provide a more adequate means of conceptualizing and assessing risk 


taking behavior, 


An increasing number of recent theo- 
retical and experimental articles have 
attempted to provide insights into the 
dynamics of risk taking (RT) and to 
relate individual differences in RT to 
other psychological variables. For ex- 
ample, RT propensities have been hy- 
pothesized to be important determiners 
of problem solving ability (Bruner, 
Goodnow, & Austin, 1956), creativity 
(McClelland, 1956a), accidents (Dun- 
lap, 1953; Suchman & Schertzer, 1960), 
vocational choice and entrepreneurship 
(McClelland, 1956b, 1961; Ziller, 
1957b), and criminality (Cohen & Han- 
sel, 1956). For the most part these 
studies have simply proposed a rela- 
tionship between behavior and RT pro- 
pensity and have been little concerned 
with the problems inherent in defining 
and assessing this presumably general 
personality characteristic. The purpose 
of this paper is to focus attention on 
the problem of developing an adequate 
conception of RT behavior upon which 
to base assessment devices for applica- 
tion in research of this nature. 


arch Institute, Eugene. 

1 Now at Oregon Researc $ 
The author wishes to thank Ward Edwards, 
Sarnoff A. Mednick, H. William Morrison, and 
Waren T. Norman for their helpful criticisms 


and suggestions. 


220 


History anv Vatipity or RT 
MEASURES 


Most dictionary definitions of risk 
center around the phrase “chance of 
loss.” Accordingly, the magnitudes of 
both probabilities and losses enter into 
the determination of the amount of risk 
which a given course of action entails. 
This section is concerned with acquaint- 
ing the reader with the rationales by 
which proposed RT measures are linked 
with the notions of probability and loss 
and with integrating three types of 
evidence relevant to the adequacy with 
which they are measuring general RT 
propensities, At this early stage of 
research, such evidence also has impli- 
cations for the adequacy of the concep- 
tion of RT as a behavioral tendency 
which will consistently exhibit itself in 
a variety of situations. 

The first type of evidence comes from 
studies attempting to correlate RT in- 
dices with other personality traits. If 
the different assessment devices are tap- 
ping a common dimension of personality 
they should have similar personality cor- 
relates. Unfortunately, only two vari- 
ables, need for achievement and mascu- 
linity-femininity, have been studied in 
enough detail to make this kind of 
analysis possible. A second type of evi- 
dence is obtained from comparison of 
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the measures with independent “risk 
relevant” behaviors such as the amount 
of risk in one’s vocation. Finally, valu- 
able information is gained by comparing 
each instrument with other measures of 
RT propensities in an attempt to es- 
tablish what Campbell and Fiske (1959) 
call convergent validity. Convergent 
validity requires different measures of 
the same trait to correlate significantly 
and substantially with one another. It 
is a necessary step whenever the trait 
under study is one which can presum- 
ably be consistently observed in dif- 
ferent situations. 

To facilitate these proposed analyses, 
RT indices have been divided, somewhat 
arbitrarily, into three categories: Te- 
sponse set and judgmental, question- 
naire, and probability and variance pref- 
erence measures. 


Response Set and Judgmental Measures 

A number of i 
concerned with wi 
on ability tests as 


nvestigators have been 
Ilingness to gamble 
reflective of general 


RT tendencies. When Cronbach (1946) 
discussed the types of response sets 
that contaminate test scores he men- 
tioned several that have been of par- 
ticular interest to researchers seeking 


objective measures of RT. These are 
the tendency to guess on a test item 
when uncertain about the answer rather 
than omit it (Cronbach labeled this the 
gambling responses set) ; the tendency 
to include many items in a given cate- 
gory when n t is imposed (the 
inclusiveness set); _the age’ to 
answer rapidly on time tests, og a 
ing to compensate for committing & 
greater number of errors by completing 
more items (th q versus accuracy 


set). 
Gambling Set. The a) to gam- 
-g on. tests that Impose a 
bie BS il as noted early by 


alty for mistakes W: 
peniaw (1936). Swineford (1938, 1941) 


o limi 


e spee 


had subjects indicate the number of 
points that they wanted each question 
on an achievement examination to be 
worth. Possible values ranged from one 
to four points and in the event the item 
was answered incorrectly, the indicated 
number of points was subtracted from 
the subject’s score. Swineford’s gam- 
bling index, the number of errors on 
questions valued at four points divided 
by the total number of errors and omis- 
sions, was found to be independent of 
ability and higher for boys than for girls. 
This latter finding is an expected one in 
our culture and is obtained with several 
other proposed measures of RT. Coombs, 
Milholland, and Womer (1956) and 
Danielson (1957) have also hypothe- 
sized a relationship between test gam- 
bling and RT, the latter investigator 
concluding that gambling proneness is a 
personality variable whose expression 
can be modified by conditions such as 
ambiguity or importance of. the test. 
Ziller (1957a, 1957b) constructed an 
index of test gambling and related it to 
preferences for risky occupations. Per- 
sons aspiring to sales occupations gam- 
pled most often while future civil engi- 
neers took fewest risks. 

Inclusiveness Set. The inclusiveness 
set was first viewed as reflecting a per- 
sonality trait by Kliiver (1936). More 
recently, Gardner (1953) noted that 
persons vary greatly in the realm of ob- 
jects that they are willing to subsume 
under the same rubric. He found that 
individual differences in this equivalence 
range were consistent over a variety of 
perceptual and nonperceptual tasks. 
This concurs with findings by Marrs 
(1955). 

Pettigrew (1958) studied a variable 
called category width that is quite 
similar to Gardner’s notion of equiv- 
alence range and to the inclusiveness set. 
Pettigrew found that when subjects were 
asked to estimate maximum and mini- 
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mum values for such items as the length 
of whales or the annual rainfall in 
Washington, D. C. they exhibited a sig- 
nificant consistency in their category 
ranges. Pettigrew suggested that cate- 
gory width may be tapping an RT 
dimension. For example, broad cate- 
gorizers seem to have a tolerance for 
Type I errors: They risk negative 
instances in an effort to include a maxi- 
mum of positive instances, By contrast, 
narrow categorizers are willing to make 
Type II errors: They exclude many 
positive instances by restricting their 
category ranges in order to minimize the 
number of negative instances. 
Bruner and Tajfel (1961) proposed 
a similar conception of category width 
in RT terms. However, unless differen- 
tial costs are explicitly associated with 
the two types of error, it is not possible 
to Jabel broad categorizing as neces- 
sarily riskier than narrow categorizing 
or vice versa. Therefore, it is surprising 
that most research has attempted to link 
broad categorizing with high RT pro- 
pensity. Wallach and Kogan (1959) 
and Wallach and Caron (1959) found 
that males were broader categorizers 
than females, Wallach and Caron also 
found that boys were more willing than 
girls to call ambiguous figures similar to 
a standard figure in a situation where 
the likelihood of error was greater when 
these figures were judged similar. These 
similarity judgments were found to be 
related to category width, Rosen 
(1961) and Rosen and Hoffman (1961) 
found that males had greater category 
widths than females and that subjects 
reduced their category widths under con- 
ditions designed to promote avoidance of 
risk. 

Speed versus Accuracy. Persons who 
sacrifice accuracy for speed and who ate 
willing to gamble on an examination 
exhibit a readiness to risk making Type 
Į errors similar to that of the broad cate- 
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gorizer. Johnson (1954) and Block and 
Petersen (1955) found speed versus 
accuracy scores to be quite stable across 
different tasks. Winder and Wurtz 
(1954) studied personality differences 
among fast and slow deciders and found 
that fast deciders scored higher on the 
Masculinity scale of the MMPI. Fillen- 
baum (1959) provided evidence indicat- 
ing moderate generality of category 
width but failed in an attempt to relate 
category width variables and decision 
speed. Only 4 out of 10 correlations 
among different category width variables 
were significant, the highest of those 
being .40. None of the category width 
measures correlated with decision time 
as measured by a task in which subjects 
were asked to estimate the most common 
characteristic of a deck of cards after 
observing as few cards as possible. 
Judgmental Measures. Other potential 
RT indices include measures of ex- 
tremity of judgment in situations where 
greater extremity affords the possibility 
of a greater magnitude of error and judg- 
mental confidence or certainty, which 
might be indicative of an individual’s 
characteristic biases in perceiving prob- 
abilities of success and failure. Brim 
and Hoff (1957) investigated the Desire 
for Certainty Test in which subjects 
were asked to complete sentences of the 
form “The chances that such-and-such 
an event will occur are about in 
100.” After making his probability esti- 
mate the subject rated his confidence 
in that estimate. Scores obtained were 
the mean confldence rating and mean 
deviation from the most conservative 
probability estimate (which is 50%). 
Wallach and Kogan (1959) studied sex 
differences on the Desire for Certainty 
Test. They found that for items on 
which persons were anything less than 
quite sure, men’s probability estimates 
were more extreme than those of women. 
For items on which subjects were very 
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sure, women’s probability estimates were 
more extreme. Women also had less 
confidence in their probability estimates 
and narrower category widths. They 
concluded that women were highly 
certain less frequently than men, but 
when they were very certain they were 
more willing to take large risks. Sim- 
ilarly, Winder and Wurtz (1954) found 
that highly confident individuals had 
higher masculinity scores on the MMPI 
than did less confident persons and were 
more likely to- describe themselves as 
adventurous on an adjective check list. 
Kogan and Wallach (1960) attempted 
to validate the interpretation of judg- 
ment certainty as an RT concept by re- 
lating subjects’ confidence in their prob- 
ability estimates on the Desire for 
Certainty Test to their semantic dif- 
ferential evaluation of concepts assumed 
to be relevant to general risk, monentary 
risk, or physical risk. Persons less cer- 
tain of their judgments tended to con- 
ceive of risk taking terms as more 
hostile, cold, and tense than did more 
confident individuals. 


Questionnaire Measures 


Torrance and Ziller (1957) con- 
structed an inventory to assess RT pro- 
pensities from a knowledge of life ex- 
periences. Their item pool consisted of 
activities hypothesized to be related to 
the development of RT tendencies. 
These items covered early childhood be- 
haviors, sports, reactions to competition, 
financial risks, risks in interpersonal rela- 
tionships, etc. They validated their Risk 
Scale against a questionnaire which re- 
subjects to chons amona SA 

ing varying degrees o k 
a dtn ons: The indices obtained 
from this questionnaire were the number 
of items on which the subject chose the 
riskiest alternative, the mean probability 
of success assigned to the riskiest acts, 
and the minimum probability of success 


quired 


that the subject said he would require 
before undertaking the riskiest acts. Cor- 
relations between the Risk Scale and 
these variables ranged from .22 to .34 
in the predicted direction. However, 
when interpreting these results one must 
note that these questionnaire indices 
shared a common self-report response 
modality and variance due to this com- 
mon method may have inflated the cor- 
relations. Correlations between these 
measures and Ziller’s (1957a) index of 
the gambling set, a measure based on a 
different response modality, were some- 
what lower. 

Williams (1960) built a Guttman 
scale called the Job Preference Inventory 
with which to measure tendencies to take 
calculated risks in an industrial setting. 
People classified as high risk takers were 
those who stated preferences for jobs in 
which they: are almost always on their 
own, have to make many decisions by 
themselves, could either be highly suc- 
cessful or complete failures, etc. Wil- 
liams found that workers classified as 
high risk takers placed more emphasis 
on promotions and upon merit as the 
criteria for mobility than did low risk 
takers. He also discovered that individ- 
uals whose jobs provided less challenge 
in terms of risk than they were willing 
to accept were more likely to be dis- 
satisfied with their jobs than were work- 
ers whose jobs entailed a satisfactory 
degree of risk. 


Probability and Variance Preference 
Measures 


Edwards (1953, 1954a, 1954b, 1954d) 
pioneered in the study of preferences 
among gambles differing in probability 
of winning and losing. He observed that 
subjects had definite preferences for bet- 
ting at some probability levels rather 
than at others. 

Coombs and Pruitt (1960) noted that 
a bet with probability p to win A dollars 
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and probability g to win B dollars can 
be viewed as a probability distribution 
over outcomes of money whose mean or 
expected value is (A) + g(B), whose 
variance is pg(A—B)?, and whose skew- 
ness is 1-2p/V pg. For bets with two 
outcomes, skewness is confounded with 
the probabilities of winning and losing. 
Coombs and Pruitt argued that in ad- 
dition to probability preferences, vari- 
ance preferences are indicative of an 
individual’s utility for risk. The vari- 
ance of a bet reflects the amount of 
deviation of its possible outcomes from 
the average amount of money to be ob- 
tained by playing the bet. For example, 
a bet offering even odds to win or lose 
$5.00 has a larger variance than a bet 
offering even odds to win or lose $1.00, 
although both have an expected value of 
zero dollars. 

Coombs and Pruitt (1960) had sub- 
jects choose among pairs of zero ex- 
pected value bets differing in probability 
of winning but with variance held con- 
stant and vice versa. They found sizable 
individual differences in probability and 
variance preferences. Variance prefer- 
ences were less stable than probability 
preferences and in general, subjects pre- 
ferred greater variance for bets which 
contained their favorite probabilities. 
The writer (unpublished) replicated the 
Coombs and Pruitt study and also found 
subjects who exhibited this shift in vari- 
ance preference. However, other sub- 
jects had variance preferences which 
were stable at different probability levels 
and probability preferences which 
shifted systematically as the variance 
of the bets changed. 

It should be noted that for bets with 
tcomes and zero expected value, 
those used by Coombs and Pruitt, 
high probabilities of 
winning carry with them the nari a 
of a large loss. Similarly, though ‘ong 
shot bets afford greater opportunities 


two ou 
such as 
alternatives with 


to lose, the amount of money that may 
be lost is relatively small. Therefore, 
there is no unequivocal .hypothesis con- 
cerning the relation between probability 
preferences among these bets and high 
RT. The magnitude of risk which per- 
sons perceive in these bets is an open 


empirical question. - 
Additional studies, with the exception 
of Edwards (1954d) and Royden, 


Suppes, and Walsh (1959), support the 
notion that variance preferences exist 
and are consistent for a given individual 
within a particular RT task. Edwards 
found that variance preferences were 
considerably less important than 
probability preferences in determining 
choices among bets. Royden et al. at- 
tempted to measure subjects’ utility for 
gambling by determining their prefer- 
ences between a gamble in which they 
could win either a or c cents, the subjec- 
tive probability of each of these out- 
comes being .5, and an option which 
gave them an intermediate value, 0 
cents, for sure. Subjects who were indif- 
ferent between the two options when 
the b value was below the expected value 
of the gamble (in this case the mean of 
outcomes a and c) were presumed to dis- 
like gambling. The authors reported 
that a simple dichotomization of persons 
into those with positive and those with 
negative utility for gambling was not 
possible although there was some regu- 
larity to their behaviors. In the absence 
of more detailed presentation of results 
it is difficult to evaluate this conclusion. 
Davidson and Marschak (1959), using 
a procedure quite similar to that of 
Royden et al., found substantial vari- 
ance preferences among subjects. In 
many instances persons chose a bet with 
relatively low expected value but with 
variance more suited to their tastes. 
Lichtenstein (1962), using bets with 
three outcomes, also found marked vari- 
ance preferences, 


ry 
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Myers and associates (Myers & Fort, 
1961; Myers, & Katz, 1962; Myers & 
Sadler, 1960; Suydam & Myers, 1962) 
have investigated the effects of variance 
and other payoff variables on the 
decision to take risks. In these experi- 
ments, subjects were asked to choose be- 
tween a gamble and a sure thing. The 
sure thing was sometimes a win and 
sometimes a loss. They found that as 
the variance of the gamble increased 
subjects became more conservative, tend- 
ing to avoid the larger potential losses 
more than they approached the equally 
larger winning outcomes. As expected, 
subjects gambled more when the alterna- 
tive to gambling was a certain loss than 
when it was a sure gain. Unfortunately, 
no data on individual differences were 
reported, 

Expectancy theorists, especially Atkin- 
fon (1957), have stimulated much work 
related to the study of probability pref- 
erences. The impetus for Atkinson’s 
risk taking model came from the rela- 
tionship that McClelland (1956b, 1958) 
found between need for achievement and 
preference for moderate probabilities of 
success in ring toss, level of aspiration, 
and vocation choice. The model involves 
six variables: the subjective probability 
(i.e., expectancy) of success (Ps), the 
subjective probability of failure (P;), 
the incentive value of success (J,), the 
(negative) incentive value of failure 
(I;), the achievement motive (Me), and 
the motive to avoid failure (Mz). Incen- 
tive values and subjective probabilities 
are assumed to be inversely and linearly 
related. The variables are combined 
multiplicatively in the following equa- 
tion: Resultant Motivation=(M,xP. 
x Ig) +My x Py x —I;). The Resultant 
Motivation function has a maximum at 
P.=.5 if M, is greater than My, and a 
minimum at P,=.5 if Mọ is greater than 
Ms. Thus Atkinson predicts that individ- 
uals in whom M, is greater than My will 


225 


prefer tasks with intermediate P, while 
persons dominated by M; will prefer 
tasks in which P, is extremely high or 
low. 

Atkinson, Bastian, Earl, and Litwin 
(1960) found support for the model 
from the fact that subjects with high 
M, preferred to shoot from moderately 
difficult distances in a shuffleboard game 
while high M; persons preferred the ex- 
tremely easy or difficult distances. They 
also obtained probability preferences 
among bets in a make-believe gambling 
situation. Subjects were asked to make 
choices such as “Would you prefer odds 
of 1/6 to win $1,800 or 4/6 to win 
$450?” Expected value was held con- 
stant but variance decreased monotoni- 
cally as probability of winning increased. 
Subjects with low M, preferred extreme 
probabilities but the preferences of sub- 
jects with high M, were evenly distrib- 
uted across all probabilities. There was 
a small positive relationship between a 
subject’s probability preferences in the 
skill and chance situations. Meyer, 
Walker, and Litwin (1961) found no 
relationship between achievement moti- 
vation and probability preferences deter- 
mined from the previously described 
make-believe bets used by Atkinson et al. 
Littig (1959) studied preferences among 
bets in which variance was held constant 
and obtained results similar to those of 
Atkinson et al. Persons with high My 
preferred extreme probabilities but Ss 
with high M, did not prefer moderate 
probabilities to extreme ones. Littig 
attempts to explain this latter finding by 
suggesting that a chance situation does 
not arouse achievement aspirations in 
subjects with high M,. Scodel, Minas, 
and Ratoosh (1959) related probability 
preferences in a real gambling situation 
to achievement motivation and other 
selected personality variables. Variance 
was not controlled. Subjects preferring 
intermediate probabilities of success and 
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intermediate payoffs scored highest on 
M,, supporting Atkinson’s theory. In 
short, studies of RT in achievement- 
oriented skill situations have generally 
supported the Atkinson model while 
probability preferences in strictly chance 
settings have led to inconsistent results. 
Several possible explanations for this 
inconsistency will be offered later. 
Both McClelland and Atkinson used 
level of aspiration tasks as indicators of 
RT propensities. This line of thought 
stems from hypotheses presented by 
Frank (1935) and by Lewin, Dembo, 
Festinger, and Sears (1944), who be- 
lieved fear of failure to be an important 
determiner of such behavior. Moss 
(1961) defined cautiousness as a general 
tendency to avoid potential failure or 
disapproval experiences even though this 
goal might be attained at the expense of 
other satisfactions. He found that per- 
sons who were cautious on a level of 
aspiration task also responded cautiously 
to projective personality measures. That 
is, they produced descriptive responses 
to TAT cards and an object sorting task. 
Such responses afford minimal risk of 
invoking criticism or personal exposure 
in contrast to highly idiosyncratic or 
imaginative protocols. 
Another type of measure, closely 
related to probability preference and 
cautiousness notions, attempts to deter- 
mine the amount of information which 
an individual characteristically requires 
before committing himself to a major 
decision based on his knowledge. The 
probability of making a correct decision 
increases monotonically with amount of 
relevant information acquired. Studies 
by Irwin and Smith (1957), Messick 
and Hills (1960), Pruitt (1961), and 
Edwards and Slovic (1962) found stable 
individual differences 1n information 
seeking behavior within a specific type 
of task. Some subjects consistently 


bought too much or too little informa- 
g 


tion relative to the optimal amount 
determined by the cost of information 
and the reward for correct decisions. 
However, Wolff (1955) failed to find 
any consistency in behavior across 
several different information seeking 
tasks in which costs and payoffs were 
not explicitly controlled. 

Wallach and Kogan (1959, 1961) de- 
veloped a questionnaire to obtain prob- 
ability preferences in everyday life situa- 
tions. Labeled the Dilemmas of Choice 
Questionnaire in one study and Disutility 
of Failure Index in another, this meas- 
ure was studied in terms of sex and age 
differences. On this test a subject is 
presented with 12 hypothetical situa- 
tions, each requiring a choice between 
a safe alternative and a more attractive 
but risky one. His task is to indicate 
the probability of success which would 
be sufficient for him to select the risky 
alternative. Men were found to require 
lower probabilities of success than women 
for items involving risks of death, 
income loss, and a football defeat, while 
women were less conservative than men 
when the risks involved were those of 
an uncertain artistic career and a 
potentially unhappy marriage. Older per- 
sons of both sexes required higher prob- 
ability of success before saying they 
would undertake the risky act. The 
internal consistency of required prob- 
ability of success across the 12 situations 
was higher for older persons, suggesting 
that preferred probabilities of success 
become more generalized with age. 


Wallach and Kogan (1961) found - 


that the Disutility of Failure Index was 
significantly related to a person’s sub- 
jective probability of failure in a task 
requiring him to guide a toy car between 
two posts set at varying distances apart. 
They concluded that the presence of 
such a relationship provides a warrant 
for thinking in terms of greater or less 
caution as a dimension of personality. 


i, 
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They also attempted, without success, 
to relate the Disutility of Failure Index 
to measures of judgmental RT such as 
confidence and extremity scores on an 
adaptation of Brim’s Desire for Cer- 
tainty Test. This finding led them to 
state, contrary to their conclusion in the 
Kogan and Wallach (1960) study, that 
until further careful empirical work is 
carried out in these areas, attempts to 
conceptualize various judgmental and 
cognitive processes in decision and risk 
terms must be considered premature. 

An extensive validational study en- 
compassing several different categories of 
risk taking measures was undertaken by 
Slovic (1962). Slovic gave subjects tests 
measuring category width, speed versus 
accuracy and gambling response sets, 
William’s Job Preference Inventory, 
Torrance and Ziller’s Risk Scale, a 
vocabulary test in which subjects 
adjusted the number of points they were 
willing to risk winning or losing on each 
item thus controlling the variance of the 
situation, and a test designed to assess 
probability and variance preferences. 

In addition, subjects were asked to 
rate their fellow fraternity brothers on 
a bipolar trait of general willingness to 
take risks. One pole was defined “Loves 
to take risks. A dare-devil.” The other 
pole was labeled “Cautious. Does not 
like to take chances. Avoids risky situa- 
tions.” Twenty-eight correlations were 
obtained among measures presumed to 
be positively related. Of these correla- 
tions, only five were significant, forcing 
the conclusion that either very few of 
the variables measure general RT pro- 
pensities or such propensities do not 
exist. 


DISCUSSION 


It is not easy to summarize the many 
studies concerned with the validity of 
instruments designed to assess RT. 
There are enough positive results to 


encourage the investigator to further 
pursue the notion of RT propensity 
as a general disposition and one which 
may provide insights into a variety of 
behaviors. For example, sex differences 
or scores on tests of masculinity-femi- 
ninity generally correlate in the expected 
direction with the different indices of 
RT. Also, every type of measure is 
favorably related to some sort of inde- 
pendent behavioral criterion of RT in at 
least one study. Finally, behavior has 
been shown to be fairly consistent across 
measures which involve similar modes 
of response and are related to RT by a 
similar rationale. 

However, a large amount of evidence 
bearing on the convergent validity of 
these methods is negative. The lack of 
communality across measures involving 
different response modalities and ration- 
ales suggests that future research on 
promising hypotheses, if undertaken 
with this heterogeneous collection of as- 
sessment devices, will most likely pro- 
duce inconsistent results. What is 
needed, therefore, is a systematic investi- 
gation of the factors responsible for 
this lack of convergent validity. Some 
ideas bearing on this matter will now be 
discussed. 

Multidimensionality of Risk. One ex- 
planation for the lack of consistency 
among many RT measures is that risk is 
a multidimensional concept and most of 
the presumably “risk relevant” measures 
have been tapping these dimensions dif- 
ferentially. As mentioned earlier, the 
usual dictionary definitions of risk center 
around the expression “chance of loss.” 
Therefore, both probabilities and magni- 
tudes of loss enter into the determina- 
tion of the amount of risk in a situation, 
Coombs and Pruitt (1960) took both of 
these factors into consideration when 
they conceptualized risk in terms of 
skewness (probability) and variance. 
They studied choices among bets in 
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which these two components were iso- 
lated and controlled and found, as did 
the writer (unpublished), a marked 
interaction between preferences and the 
position of stimuli on these dimensions. 
For example, a person’s variance prefer- 
ences often shifted drastically when the 
probability level of the bets was changed. 
This implies that it may be misleading 
to use variance preferences as an index 
of RT propensities unless one also con- 
siders the probabilities of the stimuli 
involved. Furthermore, the Coombs and 
Pruitt study used only bets that had 
zero expected value. One might argue 
that a third component of the prob- 
ability distribution which might charac- 
terize the degree of risk in a situation 
is expected value, since it describes the 
mean desirability of the outcomes. No 
one has fully explored preferences among 
gambles in which expected value, vari- 
ance, and probability have all been sys- 
tematically manipulated, but it would 
seem likely that a complete description 
of person’s RT propensities would 
require consideration of his unique pat- 
tern of preferences in such situations. 
Subjectivity of Risk. Observation of 
choices made among acts entailing vari- 
ous degrees of objective risk cannot dis- 
tinguish the extent to which RT is 
determined by individual differences in 
perception of risk or by differences in 
reaction to that perceived risk, Per- 
ceptual differences might take the form 
of discrepancies among persons’ subjec- 
tive probabilities and subjective values 
(utilities) and differences in the manner 
that perceived values on these com- 
ponent dimensions are weighted and 
combined to produce a resultant evalua- 
tion of risk. As if things are not already 
complicated enough, other petits 
the probability distribution generate y 
a bet or a real life course of action most 
likely serve as important cues for the 


perception of risk. One of these is the 


magnitude of the largest possible loss. 
Another is the proportion of potential 
outcomes that have negative utility. 

The importance of subjective factors 
in RT is illustrated by familiar circus 
acts in which tightrope walkers and lion 
tamers perform their duties with a non- 
chalance arising from a realistic knowl- 
edge of the probabilities of injury, amidst 
excited crowds who, projecting their own 
subjective probabilities of failure, feel 
that the performers are taking great 
risks. The subjective aspect of RT 
might well explain the lack of “risk 
relevance” shown by peer ratings and by 
response set and judgmental measures 
for tasks in which subjective prob- 
abilities and utilities vary widely be- 
tween individuals. Researchers may 
thus be forced to scale subjective prob- 
abilities and utilities in order to allow 
any orderliness inherent in RT behavior 
maximum opportunity to emerge. For 
a review of problems and methods of 
assessing these variables see Edwards 
(1954c, 1961a, 1962). 

Emotional Arousal and Risk. Some 
interesting insights into the theoretical 
nature of RT and its assessment prob- 
lems may be obtained by examining the 
relationship between RT and autonomic 
stimulation such as that aroused by the 
emotions of fear and hope. One cur- 
rently popular school of thought proposes 
that the individual is constantly seeking 
some optimal level of internal excite- 
ment. Risk is courted in order to raise 
the amount of excitation when it drops 
below the optimal level and avoided 
when the excitation level becomes exces- 
sive (Berlyne, 1960; Leuba, 1955). 

Cohen and Hansel (1956) provoca- 
tively suggested that risky behaviors are 
usually entered upon, not with conscious 
or calculated notions of probabilities, but 
rather with a sense of danger which may 
be interpreted as fear. Dunlap et al. 
(1953), studying risk taking among au- 
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tomobile drivers, hypothesized that the 
probability that a driver will undertake 
a risky act is inversely related to his esti- 
mate of the risk involved. Furthermore, 
they proposed, this estimate of risk 
should be directly proportional to the 
driver’s level of fear. These notions are 
compatible with those of Mowrer (1960) 
whose revised learning theory sees all 
behavior as being mediated by condi- 
tioned emotions, especially the emotions 
of hope and fear. Specifically, Mowrer 
argues that hopes and fears are the 
dynamic conditioners of choice and 
imply not only probabilities but also the 
nature and magnitudes of risks. 

These hypotheses suggest that the 
study of RT might be fruitfully pursued 
by investigating each individual’s charac- 
teristic autonomic responsiveness in 
order to determine the manner in which 
subjective probabilities of success and 
failure interact with outcome utility to 
produce the sensation of fear. A next 
step would be to analyze the relation- 
ship between this fear and perceived risk 
and decisions. Mednick (1958) discusses 
several individual difference factors 
which influence the fear response and 
may thus be relevant to RT. These are 
differences in characteristic anxiety 
level, reactivity to threat, and rate of 
recovery from fear arousal. 

Most important for the present discus- 
sion are the implications of these emo- 
tional arousal hypotheses for the evalua- 
tion of RT measures. Specifically, it 
seems likely that emotional arousal is 
a prerequisite for excitation of RT pro- 
pensities and that many negative find- 
ings may be due to failure on the part 
of assessment devices to provide the sub- 
jects with a true atmosphere of risk. 

Risk taking situations may be 
dichotomized into those whose outcomes 
are contingent upon skill, such as ring 
toss, shuffleboard, etc., and those which 
are controlled only by chance factors 


as in the majority of bet preference ex- 
periments. Skill tasks, structured to be 
achievement oriented, are quite likely 
to promote ego involvement and emo- 
tional arousal. Arousal can be promoted 
in chance situations either by the percep- 
tion of the situation as one which re- 
quires skill or by the introduction of 
actual monetary rewards and losses as 
payoffs contingent upon the conse- 
quences of the subjects’ decisions. 

Several personality variables may 
influence the degree to which persons 
will become ego involved in a make- 
believe, chance oriented situation. It 
would be quite surprising not to fnd 
the need for achievement-fear of failure 
syndrome among them, Littig (1959) 
hypothesized that persons with high My 
are more likely to become ego involved 
in a game of chance than high M, indi- 
viduals. This, if true, might explain why 
predictions of probability preferences 
based on Atkinson’s model have been 
most accurate for high My persons in 
this type of experimental setting. An- 
other variable which might influence ego 
involvement in chance situations is one’s 
attitude toward fate (Liverant & Scodel, 
1960). Persons characterized by a belief 
in their ability to order their own destiny 
should be expected to become more 
involved in a task where success Or 
failure is contingent only upon chance 
than individuals who believe their fate 
is not under their own control. 

Several recent studies illustrate the 
importance of arousing RT tendencies 
by forcing the individual to commit 
himself to a choice among alternatives 
whose positive and negative incentives 
have real meaning for him. Feather 
(1959) found that subjects in chance 
oriented situations expressed preferences 
for less probable goals but actually chose 
more probable goals when the chowa 
involved commitment to action. Slovic, 
Lichtenstein, and Edwards (in press) 
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compared choices among bets under 
make-believe and real gambling condi- 
tions and found that subjects became 
bored in the make-believe sessions and 
attempted to ease the burden of making 
tedious and difficult choices by adopting 
strategies which enabled them to choose 
among bets quickly and with a minimum 
of thought and effort. Shulman (1961) 
constructed a questionnaire for the pur- 
pose of assessing willingness to take risks 
in many areas of everyday life. He at- 
tempted to validate this test against 
Swineford’s (1938) previously described 
index of gambling on achievement ex- 
aminations. Two distinct populations 
of subjects were used. The first con- 
sisted of high school students who had 
been pupils of the experimenter and 
were well acquainted with him. The 
second sample consisted of junior col- 
lege students who had no previous ac- 
quaintance with the experimenter. The 
correlation between scores on the two 
independent measures of RT was .81 
for the high school sample and .16 
for the junior college group. Shulman 
attempted to explain this discrepancy 
by hypothesizing that an atmosphere of 
risk in the achievement test situation 
was aroused only for the high school 
students whose reputation with the ex- 
perimenter was at stake. Finally, Raw- 
son (1961) has shown that self-report 
statements concerning the morality of 
activities judged to be wrong but risked 
for exploitative gain were significantly 
correlated with actual RT behavior in 
exploitative situations only when risk 
was aroused. 

It seems evident that lack of adequate 
risk arousal results in behavior which is 
unrepresentative of the responses which 
the individual would make under more 
motivating conditions. Such ee 
is likely to be responsible np — 
the lack of convergence found among 


RT measures. 


Eliminating motivational biases by 
introducing real rewards and losses as 
consequences of decisions creates an- 
other possible source of variance for RT 
behavior, namely, that due to sequential 
reinforcement effects. Optimal stimula- 
tion theorists such as Berlyne (1960) 
predict that as consequences of previous 
decisions modify the relationship be- 
tween one’s present arousal level and his 
optimal state, RT propensities will fluc- 
tuate accordingly, thus producing pheno- 
typically inconsistent behavior, The 
precise nature of effects of reinforce- 
ment upon RT is complex and needs 
considerable study. Effects of sequence 
on subjective probabilities are already 
quite well known under the heading of 
“gamblers’ fallacy” or “negative recency 
effect” (Cohen & Hansel, 1956; 
Edwards, 1961b). Myers and Fort 
(1961) provided evidence that the 
decision to gamble at any particular 
moment is influenced by the pattern of 
previous reinforcements, but surpris- 
ingly, Slovic, Lichtenstein, and Edwards 
(in press) and Lichtenstein (1962) 
found betting behavior to be quite un- 
affected by past payoff history. 

Most of the literature discussed in this 
paper ignores sequential effects. This 
literature belongs to the realm of what 
Edwards (1961a) has labeled “static” 
decision theory. It assumes that the be- 
havior of interest is sufficiently stable 
to be of the same relative strength 
throughout the test. Since any choice 
must be imbedded in a sequence of 
choices, a static model can be only a first 
approximation at best. Nevertheless, 
static models are much simpler to apply 
than their dynamic counterparts and 
they have been shown to work to some 
extent. Therefore, present investigators 
of RT behavior should exploit the po- 
tentialities of these models while at the 
same time keeping a watchful eye upon 
relevant developments in the fields of 
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motivation, learning, and dynamic deci- 
sion processes. 

Conclusion. Risk taking behavior ap- 
pears to be multidimensional in nature. 
It has substantial subjective components 
and is susceptible to a variety of 
motivational and other influences. Re- 
search attempting to establish the va- 
lidity and consistency of RT measures 
has generally neglected these factors. It 
is, therefore, not surprising that so many 
contradictory results have been found. 
It seems quite probable that investiga- 
tions taking these variables into con- 
sideration, both in the analysis of the 
risks involved in criterion situations and 
in the design of assessment devices, 
will discover enough orderliness in RT 
behavior to successfully undertake the 
various types of research mentioned in 
the introduction to this paper. 
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Extremely low doses of ionizing radiation operate as complex stimuli 
with both cue and aversive properties. Recent studies have demon- 
strated that effects can be obtained with total doses of less than 
1 roentgen. This fact may explain some of the controversies concerning 
radiosensitivity of mammals and the conflicting results obtained in be- 
havioral and electroencephalographic tests for effects of radiation exposure 


recently reviewed in this journal. 


Furchtgott (1963) has performed an 
excellent service with his recent com- 
prehensive review of the radiation liter- 
ature. However, the implications of the 
fact that ionizing radiation is a stimu- 
lus were not amply discussed. This fact 
poses some difficult problems for inves- 
tigators of behavioral effects and sheds 
light upon some of the controversial 
reports emanating from Russian labora- 
tories concerning the effects of extremely 
low doses. A total dose of 10 roent- 
gens delivered over 4 hours has been 
employed as an unconditioned stimulus 
to establish avoidance responses in rats 
(Garcia, Kimeldorf, & Hunt, 1961). 
In the first place, this means that the 
adult animal is more radiosensitive than 
had been previously believed on the 
basis of physiological as well as histo- 
logical examinations. Whether or not 
these effects are reversible is another 
question. Furthermore, it means that 
any stimulus associated with radiation 
exposure, including handling by the 
experimenter, may acquire aversive prop- 
erties and, hence, through generaliza- 
tion, may influence subsequent compari- 
sons of irradiated and nonirradiated 
animals (Garcia, Kimeldorf, & Hunt, 
1956). In some cases Ea 
effects, which are attributed aia E 
the radiation insult, may be ac a i 
conditioned responses to such stimuli 
by an animal which has completely re- 
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covered. Conditioned aversions may 
endure many weeks after a single expo- 
sure (Garcia, Kimeldorf, & Koelling, 
1955). The contradictory results, in- 
creased variability and fluctuating 
changes in performance, which charac- 
terize behavioral data, may be in part 
due to failure to control this factor. In 
the same way, physiological measures 
on unanesthetized intact animals can 
also be complicated, especially those 
measures which reflect the general adap- 
tation syndrome described by Selye 
(1956). 

There is no reason to suppose that 
radiation is an “unconscious” stimulus 
or “a UCS of which the animal is not 
aware.” Such terms are not only con- 
fusing but also misleading. The immedi- 
ate arousal properties of radiation have 
been demonstrated in invertebrates; 
furthermore, immediate visual effects 
in the dark-adapted eye are well estab- 
lished, as Furchtgott points out. Hence 
it should not be surprising to find im- 
mediate effects upon the electroen- 
cephalogram (EEG) or an inhibitory 
effect upon conditioned responses under 
certain conditions. Some of the 
“depressions” of cortical EEG activity 
reported by Russian investigators re- 
semble the pattern of low voltage, fast 
activity which accompanies arousal 
(Livanov, 1962; Tsypin & Grigor’ev, 
1960). 
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In our own laboratory, we have ob- 
served the arousal properties of X ray 
in mammals and also have demonstrated 
its use as a conditioned stimulus (CS). 
Unfortunately these most recent data 
were not yet available to Furchtgott. 
In sleepy animals X rays produce im- 
mediate EEG “desynchronization,” that 
is, the change from high voltage slow 
waves to low voltage fast waves, as well 
as behavioral arousal (Garcia, Buch- 
wald, Bach-y-Rita, Feder, & Koelling, 
1963). Furthermore, X rays produce 
an orienting response in active animals 
which will momentarily interrupt bar 
pressing in a Skinner box. X rays have 
been employed as a warning stimulus 
(CS) to signal a subsequent shock to 
the animal’s paws. After a few trials 
the animals attempt to escape from the 
compartment at the onset of radiation 
indicating that the defensive responses 
to shock have been conditioned to 
X ray. In subsequent tests, X rays were 
employed to inhibit bar pressing with 
a variation of the Skinner-Estes condi- 
tioned suppression technique (Garcia, 
Buchwald, Feder, & Koelling, 1962). 

Radiation is a complex stimulus pro- 
ducing its effects through many differ- 
ent mechanisms. Blind animals respond 
to X rays, indicating that the retina 
which is extremely radiosensitive is not 
necessary for arousal nor avoidance con- 
ditioning. Furthermore, the arousal and 
cue properties differ markedly from the 
aversive properties in latency and thresh- 
old, indicating that different mecha- 
nisms may be responsible. Avoidance 
has been established with intensities as 
low as .0007 roentgen per second when 
the exposure was maintained for hours. 
While .1-.2 roentgen per second will 
cause arousal within 1 second, the ani- 
mals do not attempt to escape immedi- 


ately; on the contrary they apparently 
habituate readily to brief exposures. 

These apparent discrepancies are 
probably due to the widespread systemic 
effects of ionizing radiation which oper- 
ate through diverse humoral and neural 
factors to produce their effects upon 
behavior. Accumulations of breakdown 
products could account for the aversion 
effects which appear to be a function 
of total dose, while the arousal and cue 
effects seem to indicate a fast respond- 
ing receptorlike action functionally re- 
lated to intensity or dose rate. 
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A reply to Garcia and Buchwald’s elaboration on the stimulus charac- 
teristics of radiation. (a) Whether radiation should be considered as an 
unconscious stimulus is a semantic problem. (b) The arousal properties 
of radiation which have been previously postulated were, however, ex- 
perimentally demonstrated during the period that the review paper was 
in press. (c) To account for the diverse Soviet conditioning data in 
terms of the short-term arousal phenomenon is much too simplified an 


explanation. 


Garcia and Buchwald’s (1964) com- 
ments elaborating on the treatment 
of irradiation as a stimulus deal essen- 
tially with the following problems: 

First, they object to calling radiation 
an “unconscious” stimulus. In review- 
ing Arbit’s (1958) theoretical paper I 
simply paraphrased his description of 
radiation as a stimulus of which the 
animal may not be “aware.” Razran 
(1961) in his comprehensive review of 
Soviet studies on interoceptive condi- 
tioning says: 

Unlike the continuum of exteroceptive stimu- 
lation which is the body-material of all our 
conscious experience, the continuum for intero- 


ceptive stimulation leads largely to uncon- 
scious reaction [p. 97]. 


Several studies, including those of Gar- 
cia and his co-workers (see Garcia, 
Kimeldorf, & Hunt, 1961, for a review), 
as well as a recent study by Barnes 
(1962) have shown that, in condition- 
ing, radiation acts primarily through 
the interoceptors and that exclusion of 
visual and olfactory receptors does not 
affect the phenomena in which radia- 
tion acts as the unconditioned stimulus. 
Whether one wants to refer to radia- 
tion as unconscious stimulus depends 
upon one’s definition of the term. 
Secondly, Garcia and Buchwald 
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(1964) are attempting to account for 
the stimulus properties of radiation by 
pointing out that it leads, as do most 
other conventional stimuli, to arousal 
which may be detected electrophysio- 
logically or behaviorally. That radia- 
tion leads to EEG changes which re- 
semble those usually associated with 
sensory induced arousal has been pre- 
viously suggested by both Soviet (Liva- 
nov, 1962) as well as some Western 
investigators (Gangloff, 1962). Be- 
havioral arousal after low doses, how- 
ever, was not definitely demonstrated 
until a year ago (Hunt & Kimeldorf, 
1962). It is unfortunate that because 
of the long publication lag in the 
Psychological Bulletin during the time 
that my paper was in press a considera- 
able amount of progress had occurred 
in the area affecting some of the pub- 
lished conclusions. In all fairness, how- 
ever, if Garcia and Buchwald call at- 
tention to their own experiments to 
account for some of the previously 
observed stimulus characteristics of 
radiation, they should mention also some 
other recent publications from other 
laboratories which also bear directly 
upon the problem. Barnes (1962), 
Hunt and Kimeldorf (1962), Andrews 
and Peterson (1962), to mention just 
a few studies, have all published perti- 
nent material. 

Finally, to account for the complex 
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and diverse Soviet data on the effect of 
irradiation in conditioning studies in 
terms of the short-term arousal effect 
noted by Garcia and co-workers is much 
too simplified an explanation. Without 
going into any great detail I would 
like to mention just a point or two to 
illustrate the difficulties encountered in 
trying to elucidate the role of radiation 
in conditioning. In all of the radiation- 
induced spatial avoidance conditioning 
studies the effects could not be obtained 
with doses of less than approximately 
100 roentgens. Furthermore, the phe- 
nomenon was always observed either 
immediately following irradiation or 
within a period of less than 36 hours 
following the conclusion of irradiation. 
Low doses have no effect and neither do 
high ones (Andrews & Peterson, 1962). 
In the Garcia, Buchwald, Feder, and 
Koelling (1962) study the inhibition of 
bar pressing occurred in animals which, 
during the conditioning series, were sub- 
jected to an unknown quantity of radi- 
ation. (The authors did not state the 
dose rate during the conditioning series. 
Tf the dose rate was the same as during 
the experiment proper, then the animals 
must have received 400 roentgens.) 

Now in the Soviet studies changes in 
conditioning were observed after ex- 
posures of a few roentgens or less, and 
these changes sometimes lasted for sev- 
eral days after the cessation of exposure 
or even for 1 year in one case (Samoy- 
lova, 1959), Also, the changes involve 
different parameters of conditioning in 
different studies. To state that diverse 
phenomena observed by Soviet condi- 
tioning investigators can be explained 
on the basis that irradiation affects the 
arousal mechanism does not carry a suf- 
ficient amount of explanatory power to 
account for the diversity of changes 
which has been observed. 

It should be noted that in my own 
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review I have stated that the Soviet 
findings are probably explicable in terms 
of performance rather than associative 
changes. As a matter of fact, already 
in my 1956 review I had noted that radi- 
ation affects motivational and emotional 
variables and that the effects should be 
most apparent in performance in which 
the extrinsic incentives are relatively 
small. This gross generalization is prob- 
ably still valid. The details in this area 
of research are rapidly accumulating, 
and Garcia and co-workers have made 
here notable contributions. 

It is unfortunate that the long publi- 
cation lag has made my 1963 review 
paper less up to date than desirable. 
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EXACT MULTINOMIAL 


SIGNIFICANCE TESTS: 


NOTE ON A PAPER BY ALPHONSE CHAPANIS 


N. T. GRIDGEMAN 
National Research Council of Canada, Ottawa 


Table is given of the 19 possible orderings of 12 items randomly dis- 


tributed into 3 categories. 


Consider two indistinguishable urns, 
one of which contains equal and the 
other unequal, numbers of red, white, 
and blue balls. A sample of 12 is 
drawn at random, with replacement, 
from each. Suppose that the results are: 


red white blue 


Urn A Fä 4 4 
Urn B 2 3 7 


Which urn then is the more likely to 
house the uniform distribution? Com- 


mon sense supplies a ready answer. And 
Alphonse Chapanis’ (1962) paper al- 
though primarily concerned, as its title 
indicates, with single-sample work, has 
a table (Number 3, page 309) that 
should furnish a precise answer. Sur- 
prisingly, this answer favors Urn B. 
Intuition is sometimes a fallible guide 
in problems of statistical inference, but 
this sharp clash between the intuitive 
and the formal seems worth looking into. 

An explanation is to be found in the 


TABLE 1 


Exact AND x? 


PROBABILITIES OF THE 19 POSSIBLE Outcomes Wren 12 ITEMS ARE 


RANDOMLY DISTRIBUTED INTO 3 CATEGORIES 


Number of: x? probability 
Exact a 
Outcome Permutations | Arrangements basen ote Uncor- Corrected 
per outcome |per permutation probability rected for á 
continuity 
12 0 0 3 1 .000006 .00001 00003 
1110 6 12 000141 .00010 .00030 
10 2 0 6 66 000886 .00091 .00104 
10 1 1 3 132 .00163 00117 00248 
9 30 6 220 .00412 .00525 00674 
8 40 6 495 .00970 .0183 0267 
921 6 660 0172 00866 .0126 
750 6 792 .0261 0388 .0440 
660 3 924 .0313 0498 0725 
831 | 6 1980 0537 0388 0440 
& 2 2 3 2970 .0704 .0498 0725 
741 6 3960 115 106 135 
651 | 6 5544 178 174 253 
7 2 2 6 7920 .267 174 253 
6 42 6 13860 424 368 417 
552 3 16632 .518 412 606 
6 3 3 3 18480 622 412 606 
S 4 3 6 27720 935 779 883 
44 4 1 34650 1 1 a 
Total 91 


Note.—Totality of arrangements = 531,441. 


239 


240 


puild-up of Chapanis’ Table 3, which 
gives cumulative probabilities for all 
possible “outcomes” in 12-item samples 
from a discrete equipartite trinomial 
population. These outcomes, 19 in num- 
ber, are however cumulatively ordered 
by their gross probabilities, whereas the 
requirement is an ordering by the prob- 
abilities of the component permutations. 
For instance, the outcome 2-3-7 has 6 
permutations; and in terms of the urn 
model they would be: 2 red, 3 white, 


N. T. GRIDGEMAN 


7 blue; 2 red, 3 blue, 7 white; 2 white, 
3 red, 7 blue; etc. 

Table 1 gives the proper ordering. 
The exact test is of course better than 
the x? test, but the discrepancies are 
now fewer and smaller. And there are 
no clashes with intuition. 
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COMPETING THEORIES OF RECEPTOR 


EXCITATION 


IN THE RETINA: 


A SYMPOSIUM 


HARRY HELSON 
Kansas State University 


Introductory remarks to papers by Enoch, McConnell, Boynton, and 
Blackwell in a symposium bearing the above title at the 1962 conference 
of the American Psychological Association in St. Louis. 


The following two papers by Enoch 
and McConnell and the discussions by 
Boynton and Blackwell constituted a 
symposium bearing the above title which 
was held at the Annual Conference of 
the American Psychological Association 
in St. Louis, 1962. In spite of their 
technicality there was an overflow at- 
tendance to hear these papers thus at- 
testing the wide interest in the latest 
work and thinking on mechanisms of 
color vision. Both the main presentations 
and the discussions show how far recent 
visual theory has progressed beyond the 
classical formulations of Helmholtz, 
Hering, and Ladd-Franklin which were 
chiefly designed to correlate phenomeno- 
logical data with supposed retinal proc- 
esses. As these papers SO well show, 
theory has advanced to the point where 
interactions of light and the finer struc- 
tures of the retina and interactions of 


light with photosensitive pigments are 
now in the center of attention, Further- 
more, the adequacy of current theories 
is judged by reference to quantitative 
data obtained by meticulous biophysical 
and biochemical methods as well as by 
their agreement with psychophysical 
measurements. 

There is much more in the papers by 
Enoch, McConnell, Boynton, and Black- 
well than can be conveyed in a short in- 
troduction. Those interested in learning 
about this area, as well as those com- 
mitted to either a physical or chemical 
type of color theory or to some third 
approach, will find these papers worthy 
of study. The Editor of the Bulletin, 
who served as Chairman of the sympo- 
sium, is proud to present material of this 
caliber and importance in its pages. 


(Received March 3, 1963) 
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PHYSICAL PROPERTIES OF THE RETINAL 
RECEPTOR AND RESPONSE OF RETINAL 


RECEPTORS? 


JAY M. ENOCH 
Washington University Medical School, St. Louis 


The author reviews studies of the waveguide characteristics of retina) 
receptors. These properties are present in human photoreceptors. Some 
of the possible roles played by these physical properties are discussed, 
with emphasis placed upon color-vision mechanisms at the receptor level. 
The author also considers approaches toward the determination of the 


role(s) played by the waveguide properties, by the photosensitive pig- 
ments, and by “other” mechanisms in the coding of the signal at the 
detector level. He describes his initial experiments directed toward the 
development of an indicator of response (or marker) in single mam- 


malian retinal receptors. 


In recent years, this investigator has 
described the waveguide characteristics 
of retinal receptors in rats, monkeys, 
and humans (Enoch, 1960, 196la, 
1961b, 1961c, 1963a). The physical 
characteristics of the receptors give rise 
to nonuniform distributions of energy 
in the photosensitive, pigment-bearing, 
retinal receptor outer segments. These 
nonuniform distributions of energy vary 
as a function of wavelength. That is, 
each receptor passes a given wavelength 
band(s) and this energy is distributed 
spatially in the receptor in certain spe- 
cific patterns. Since the incident energy 
is broken up, or separated, into different 
wavelength bands in the same and neigh- 
boring receptors, and since different 
wavelengths give rise to different chro- 
matic sensations, one may ask if the 
waveguide characteristics represent the 
first step in the coding of the chromatic 
aspect of the visual stimulus. Because of 


1 This research has been supported in part 
by Grant B-2168 from the National Institute 
of Neurological Diseases and Blindness, Na- 
tional Institutes of Health, Public Health Sery- 
ice, Bethesda, Maryland. i “er 

is paper was presente as part of the 
oe “Competing Theories of Recep- 
tor Excitation in the Retina” at the meeting of 
the American Psychological Association, Sep- 


tember 3, 1962. 
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the considerable complexity of these 
characteristics, one cannot simply fit 
these findings into classical theoretical 
frameworks. At this point, what is 
needed is not another color-vision 
theory, or theories; rather, we require a 
method (or methods) which will enable 
us to decide which properties define the 
actual response of the retinal receptor. 
A crucial question is: How can we pre- 
cisely determine the role played by wave- 
guide characteristics, the role played by 
the photochemical properties (Dartnall, 
1957; Rushton, 1962; Wald, 1959), and 
the role(s) played by “other” mecha- 
nisms (Ingelstam, 1956),* in the re- 
sponse of the receptor, and in the coding 
of the visual stimulus? The plethora of 
theories of vision and color vision can 
only exist in our vacuum of knowledge 


2While the author may say with confi- 
dence that the waveguide form of energy trans- 
fer is present in retinal receptors, he cannot 
yet say that the distribution observed experi- 
mentally in any given receptor is of necessity 
the same as that which was present during 
life. It is possible that some of the complexi- 
ties are introduced as a result of the experi- 
mental technique. This is another reason for 
the author’s interest in the techniques dis- 
cussed toward the end of the paper. 

3 This reference is one example of a number 
of other mechanisms which have been postu- 
lated. 
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concerning the events occurring between 
the bleaching of the photosensitive pig- 
ment and the activation of the neurologi- 
cal chain (and, ultimately, the visual 
response). 

In this paper, the author will review 
his findings relative to waveguide prop- 
erties present in retinal receptors of 
many species (including human); he 
will discuss the possible role(s) played 
by these properties in color vision; he 
will consider some methods of attacking 
the problem of the response of the single 
retinal receptor; and he will present 
samples of his first efforts directed to- 
ward the solution of this problem. 


THE RETINAL RECEPTOR AS A 
WAVEGUIDE 


The term “waveguide” essentially de- 
scribes the function of the receptor. The 
luminous energy is literally guided, or 
channeled, by the receptor into its outer 
segment, that is, the zone containing the 
photosensitive pigment. Of significance 
are the characteristics of retinal recep- 
tors acting as waveguides, and the wave- 
guide form of energy transfer or trans- 
mission, 

Energy, when transferred in a wave- 
guide, is propagated in the form of cer- 
tain specific waveguide modal patterns. 
A typical modal pattern photographed 
at or near the termination of a cone 
receptor is shown in Figure 1. These 
patterns, which are the result of inter- 
actions of energy in the receptor, are 
described by a series of summations and 
cancellations of the energy. They are 
best observed and photographed in 
monochromatic light. These results may 
be predicted to a considerable extent 
on the basis of mathematical models 
(Snitzer, 1961; Snitzer & Osterberg, 
1961) meeting the conditions of cylin- 
drical dielectric waveguides. 

It is important to realize that these 
patterns extend slightly beyond the cell 


Fic. 1. A typical retinal receptor wave- 
guide modal pattern as photographed in mono- 
chromatic light—A=580 millimicrons. (The 
microscope was focused upon the termination 
of the retinal receptor, the incident energy 
subtended a cone angle at the retina of 3 de- 
grees, and the receptor was well oriented. The 
energy passed through a retinal cone near the 
central fovea of a squirrel monkey. A length 
of 1 micron is indicated.) 


boundary. If the separations between 
the receptors are not sufficient, this en- 
ergy may leak into surrounding cells. 
This could reduce visual resolution and 
influence transmission of energy in neigh- 
boring cells. At this time, it is not be- 
lieved that this interreceptor leak factor 
is too significant in the normal (human) 
retina. However, the author reserves 
judgment on this point until further 
studies are completed. 

One observes these patterns readily if 
some care is taken in preparing the 
specimen. The instrument required is es- 
sentially a schematic eye. One focuses 
an intense source of light upon a piece 
of freshly dissected retina oriented as it 
might be in the eye. It is important to 
limit the angular subtense of the inci- 
dent beam to angles normally subtended 
by the pupil of the eye. The dissected 
specimen, which is placed in a special 
chamber to avoid crushing, is observed 
with a microscope. One focuses the 
microscope at or near the tips of the 
photoreceptor outer segments and ob- 
serves modal patterns emerging from 
the cells after that energy has passed 
through the cells. Samples of typical 
human retinal receptors are shown in 
Figure 2. 
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Fic. 2. Hypothetical human retinal recep- 
tors. (The arrow indicates the direction nor- 
mally traversed by the incident light. The 
cell to the left is a parafoveal cone, and th 
receptor to the right is a rod, The parts i 
the cell are: o, the pigment-bearing 0 
segment; e, the ellipsoid which ts outer 
densely-packed oriented mitochron is mtaing 
external limiting membrane; n, the i 1, the 
The synaptic region is not ae cell nucleus, 
of G. Walls, The Vertebrate p n. Courtesy 


Hills, Michigan, Cranbrook Pr = oe 
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Those experimental factors which vary 
the patterns propagated are the wave- 
length of the light, the angle of inci- 
dence of the incident light, and the plane 
of focus of the microscope. Polariza- 
tion of the incident light has little ap- 
parent effect upon the result. 

Perhaps the best way to summarize the 
phenomena one observes is to describe 
the events which take place when one 
introduces certain variables. First, let 
us consider an experiment where a non- 
foveal retinal sample is illuminated by 4 
perpendicularly incident, narrow wave- 
length band, beam of light. The light 
passes through the retina as it might in 
the living eye. One racks the microscope 
down from a plane of focus above and 
behind the retina. The first entities one 
observes through the microscope are the 
remnants of the pigment epithelial layer. 
The pigment granules are clearly visible, 
particularly in monkeys. Passing down 
past this level, one focuses at, or near, 
the terminations of the rod receptors. 
Here one sees a modal pattern in each 
and every receptor, The types of pat- 
terns observed in reasonably well-or- 
iented receptors fall into certain limited 
distinct categories. In some instances 
one sees interactive forms. That is, if 
physical conditions are such that more 
oe pee 
ee A > Es own the receptor), and 
tenek as does propagate, 
are patterns may interfere with 

other, and one observes the result- 
ant forms. In these cases, if the two dis- 


tributions are proceeding with differing 
velocities down the receptor, one may 
anticipate the presence of essential sand 
ingswave pattems with concentrations 
Energy at certain points along the 7 af 
Or outer segment, These points ot oe 
centration indicate where the “9 
terns are “in step.” Schroeder sic 
of the Radio Corporation of i) 
and Snitzer and Polanyi (19 
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American Optical Company have con- 
sidered possible interactions. 

Between the receptors one often sees 
microfibrils (penetrating for a short 
distance) from the pigment epithelial 
layer. These microfibrils contain pig- 
ment granules. In some receptors the 
modal patterns are much more intense 
than in others. There are two essential 
reasons for this, either the receptor is 
less well oriented, or it is not trans- 
mitting much energy of this particular 
wavelength. The differentiation between 
these two possibilities is made by chang- 
ing the plane of focus and/or the wave- 
length of the incident light. It is also of 
interest that, when focusing on the rod 
terminations, there are reasonably regu- 
larly disposed darkened areas. These 
turn out to be the locations of the cones 
when one continues to rack the micro- 
scope in the same direction. 

When one changes the plane of focus 
in this manner, the patterns usually be- 
come less bright than at the termina- 
tions, their brightness fluctuates, and 
the modal patterns may change. The im- 
portant point is that some pattern is seen 
at almost all planes of focus in the recep- 
tor. This energy probably represents 
that which is leaking out of the recep- 
tors prior to reaching the receptor termi- 
nations, At a certain point one sees the 
highly intense cone terminations. They 
are more intense than the rod termina- 
tions because of the essentially integra- 
tive action of the ellipsoid portions of 
these cells which collect much of the en- 
ergy from the larger diameter cone inner 
segments and guide that energy into the 
smaller diameter outer segments. The 
more oblique the angle of the incident 
light, the smaller is the amount of energy 
which reaches the outer segment. When 
one continues to focus the microscope 
down to still deeper levels, at the point 
where the cone diameters enlarge, one 
often observes one or more modal-pattern 


changes. This is one factor which sug- 
gests that the final transmission char- 
acteristic of the receptor is determined 
in the outer segment itself. 

If on varying the plane of focus of the 
microscope the observed pattern(s) ap- 
pear to shift sidewise in position in the 
field, this indicates a defect in receptor 
alignment in the specimen. The magni- 
tude and direction may be measured. 
Few retinal receptors are perfectly co- 
axial with the microscope—in many 
cases this may be an artifact associated 
with the preparation. 

Tf one continues to rack the micro- 
scope down after passing into the region 
of the receptor inner segments, at some 
point the light breaks up into a myriad 
of small points. Presumably one has 
passed the external limiting membrane. 
Backing the microscope off all the way in 
the other direction, one observes the 
rapid dissipation of the radiating pat- 
terns after passing the receptor termina- 
tions. 

If one now refocuses the micrscope at, 
or near, the receptor terminations and 
varies the wavelength (or color) of the 
incident light, one observes sudden 
changes in modal pattern as well as 
marked variation in relative transmis- 
sion of energy. Similarly, if one varies 
the obliquity of incidence of the incident 
light, one also sees sudden shifts in 
modal pattern. Increasing wavelength 
and increasing obliquity of angle of inci- 
dence often (but not always) produce 
similar changes in modal pattern at the 
same receptor cell outer segment termi- 
nation. 

When one introduces obliquity of 
incidence, one observes a decrease in the 
amount of energy passing all the way 
through the receptors. As the angle of 
obliquity increases, the decreases in ap- 
parent brightness in cones is somewhat 
more rapid than in rods. On the basis of 
innumerable observations, the author 
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feels that the individual rods and cones 
are quite a bit more directionally sensi- 
tive than psychophysical functions sug- 
gest. The latter functions probably 
should be considered as summed group 
effects. 

If white light is used instead of mono- 
chromatic light, many interesting events 
are observed. First, the retina takes on 
a richly variegated, multihued, mosaic- 
like appearance. This effect is related 
both to differences in transmissivity and 
to the changes in modal pattern as a 
function of wavelength. It is clear that 
a physical separation of the incident 
wavelengths takes place. If one changes 
the angle of incidence, the photoreceptor 
transmissive characteristics change as a 
function of wavelength (in addition to 
other changes in modal patterns and 
total transmissivity). If one changes 
plane of focus, one sees different wave- 
length components leaking out of the 
retinal receptor at different levels. 

In summarizing these events, one may 
say that the incident energy is broken up 
in the retinal receptors. Different wave- 
lengths are located or concentrated at 
different spatial positions in different re- 
ceptors and within the same receptor. 
These nonuniform distributions repre- 
sent the input stimulus distributions to 
the individual retinal receptors. A recep- 
tor can only respond to, and hence de- 
tect, that which is delivered to its photo- 
sensitive pigment (s). 

RoLE(S) PLAYED BY THESE 


PROPERTIES IN COLOR 
VISION 


The challenging question which must 
now be considered is: What functional 
role does this form of energy transfer 
play in the visual process? The most 
obvious consideration, and the one we 
shall discuss here, is that of color vision. 
The color-vision problem at this level is 
one of coding the information cigs 
the image by the retinal receptor. Are 


waveguide properties used by the visual 
system for coding purposes? 

We do know that at some point in the 
system a given luminous stimulus may 
be described in terms of three variables. 
However, this does not mean that the 
response is limited to this number of 
variables at the receptor. It has gen- 
erally been held that there are three or 
more different photosensitive pigments in 
cones, and that they provide the neces- 
sary wavelength separation for coding 
purposes. The research reported in this 
paper demonstrates that wavelength 
separation does, in fact, take place physi- 
cally in the receptors. That is, the light 
is broken up into different wavelength 
components located at different places in 
the same, and in different cells, If this 
information is used by the visual sys- 
tem, the manner in which it is used is 
dependent upon the number of different 
photosensitive pigments present and how 
the photosensitive pigment molecules are 
distributed. With the volume of in- 
formation present in the physical dis- 
tribution, it is immediately clear that 
only one type of photosensitive pigment 
is required by the visual system. How- 
ever, this does not necessarily mean that 
only one type of retinal cone pigment is 
present, 

Following this line of reasoning, one 
may ask several questions. What part 
of the coding task takes place in a single 
receptor cell? Considering the trans- 
mission characteristics of these cells, is 
it possible that one cell can accurately 
describe the entire input distribution to 
the nervous system? Is the response of 
any single cell ever independent (Sjo- 
strand, 1958)? How well does the dis- 
tribution of photosensitive pigment cor- 
relate with the physical distribution of 
the energy? 

The separation of the incoming en- 
ergy into various wavelength (color) 
components occurs in at least five differ- 
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ent ways. First the waveguide modal 
pattern often changes in a single cell 
with variation of wavelength. This 
means that the transmitted energy is 
redistributed in the outer segment of the 
cell. In order for this distribution of 
energy to be used for coding purposes, it 
is clear that the cell, acting as a detector, 
must be able to determine that a change 
has occurred. This information could be 
utilized if the photosensitive pigment 
or pigments were nonuniformly dis- 
tributed in the cell. As an alternative, 
the response of the cell might be depen- 
dent upon the location of the bleached 
pigment in the cell, for example, distance 
from the surface membrane, etc. Clearly, 
if such mechanisms exist, changes in 
physical characteristics of the cell could 
lead to visual anomalies through poor 
correlation of the concentrations of light 
energy and the sites of specific photo- 
sensitivity. Also implied, in a mecha- 
nism such as this, is the ability of the 
cell to transmit or to transfer centrally 
the information acquired. How much of 
the entire coding task is achieved in a 
single detector is dependent upon its 
complexity (or sophistication, or intelli- 
gence) as a detector and its independence 
of action. 

A second mechanism for separating 
the incident light into separate wave- 
lengths is mode interaction. The stand- 
ing waves set up in this manner result in 
local concentrations of energy along some 
cone outer segments. Their loci vary 
with wavelength. The only difference 
between this case and the first case 
is that the concentrations of energy are 
probably not so discretely placed in the 
receptor outer segment. The conditions 
for utilization of that information are 
precisely the same. 

The third mechanism by which the 
input distribution of energy is separated 
is related to the selective manner by 
which energy is transmitted in the re- 


ceptor. When white light is focused 
upon the receptor and one looks at the 
terminations of the receptors, some ap- 
pear red, others blue, etc. (Enoch, 
1961a). Thus, different receptors se- 
lectively transmit energy distributions 
of different wavelengths more or less 
efficiently. This is a filterlike effect. 
Snitzer and Polanyi (1961) refer to this 
as “flash-point transmission.” The ef- 
fect is sensitive to angle of incidence 
of the energy at the retina; and, hence, 
the receptor cannot be considered a 
true filter. This filterlike action requires 
only a relatively unsophisticated de- 
tector for its utilization. The main re- 
quirement is that the detector be 
“aware” that its physical characteris- 
tics favor the transmission of a given 
wavelength band. Whether or not this 
information is actually used by the sys- 
tem, this phenomenon must bias the re- 
sult. Again, it must be determined how 
well correlated are the transmissive prop- 
erties and the detecting properties. 

A fourth mechanism arises out of the 
third one. If white light is incident at 
the receptor, and red light is seen emerg- 
ing from the receptor termination, one 
may inquire as to what happened to the 
remaining energy. Obviously, some is 
reflected or scattered, and does not 
propagate, or is not transmitted in the 
receptor. Other energy which is trans- 
mitted leaks out at different points prior 
to the receptor termination. That is, 
different levels within the receptor outer 
segment probably have different wave- 
length compositions. In order to use this 
information one must again consider the 
sophistication of the detector, and the 
response transfer mechanisms. In our 
experimental preparations most of the 
photosensitive pigment was bleached; 
hence, there was little absorption. 

Lastly, the energy propagating outside 
the cell does not come to bear upon the 
photosensitive pigment. The magnitude 
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of this component varies with wave- 
length and modal pattern. The result is 
a filterlike effect which almost certainly 
biases the result to some extent. Myers 
(1962) recently considered this phase of 
the problem. 

One notes that all of these effects are 
present in both rods and cones. Because 
vision dominated by rod receptor re- 
sponse apparently has no chromatic 
component, one cannot say that wave- 
guide mechanisms do not play a role in 
color vision. One may argue simply that 
rods do not have any mechanism present 
which is capable of utilizing the informa- 
tion present. By the same token, one 
cannot claim other than a biasing role 
for waveguide characteristics in cone 
vision because wavelength separation is 
present. Such bias could lead to an 
anomalous visual response. It becomes 
necessary now to design and conduct ex- 
periments which will indicate the true 
role of these phenomena in vision, 

In some well-oriented rod-free central 
foveal areas, one notes that the transmis- 
sive properties of different cells are 
apparently not randomly distributed. 
Rather, at regular intervals a transmis- 
sive pattern appears to be repeated. This 
is particularly evident when monochro- 
matic light is employed. One can argue 
that such patterning of these properties 
indicates probable utilization of these 
characteristics. 


DEFINING THE RESPONSE 
CHARACTERISTICS OF 
THE SINGLE RECEP- 

TOR CELL 


In order to evaluate the role of wave- 
guide characteristics, photochemical 
properties, and other possible mecha- 
nisms in receptor response, it is desirable 
to study the parameters defining the 
response of single receptors. The wave- 
guide characteristics of a given receptor 
are dependent upon the configuration of 
the cell, the diameters of its component 


parts, the index of refraction of its parts 
relative to the surrounding medium, the 
separation of the cell from its neighbors, 
and its orientation relative to the optical 
system of the eye. The presence of, and 
distribution of a photosensitive pigment 
or pigments within single or different 
cells probably, but not necessarily 
(Brindley, 1960; Ingelstam, 1956), de- 
fines the transducer. 

It is important to realize that if a 
photosensitive pigment having peak sen- 
sitivity in the red is measured, this fact 
does not prove that the system is aware 
that this is an effective red detector. 
Similarly, predominantly red transmis- 
sion due to waveguide characteristics, or 
the presence of a red oil droplet which 
acts as a filter in the inner segment (e.g., 
in the pigeon) does not define the coding 
mechanism. One must consider these 
factors in terms of probability state- 
ments. One must seek to correlate re- 
sponse at a level (or levels) central to 
the outer segment with those physical, 
morphological, and chemical factors con- 
cerning the detector which one can de- 
termine. Presumably, those physical 
and/or chemical characteristics which 
have the highest correlation with re- 
sponse (in the normal) define the coding 
mechanism. The solution of this prob- 
lem will not be simple. 

The bleaching of the photosensitive 
pigment bears a functional relationship 
to response, but the bleaching of any 
given photosensitive pigment molecule 
may not result in a response. Some of 
the mechanisms (relating to the possible 
role(s) played by waveguide properties) 
suggested above depend on this possi- 
bility. This is not a new concept. For 
example, Wald (1954) in his compart- 
ment theory makes use of this point. 

In all of our discussions of cell re- 
sponse we have tended (largely through 
lack of knowledge) to overlook the role 
of the secondary mediators of receptor 
response. Since the time of Hecht, we 
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have realized that the bleaching of the 
photosensitive pigment only “triggers” 
retinal response. The response charac- 
teristics of the receptor may be deter- 
mined to a considerable degree by the 
properties of the succeeding chemical re- 
actions and/or physical events. For ex- 
ample, if receptor-cell excitation is medi- 
ated through a chemical reaction (s) 
induced by the breakdown of the photo- 
sensitive pigment, we have no right to 
assume the constituent substances are in 
infinite supply, cannot be (partially, or 
locally, or largely, or temporarily) de- 
pleted, and are uniformly distributed 
throughout the receptor outer segment 
or other critical zone. 

Microspectrophotometry can tell us 
the nature of the photosensitive pig- 
ments present (and probably how much 
of a given pigment has been bleached), 
and this author’s methods will ultimately 
provide information relative to wave- 
guide properties. We now need to turn 
our attention toward the definition of in- 
dicators or markers of individual recep- 
tor response. We must know which re- 
ceptors have been actively responding 
when excited by a given stimulus, and 
in turn, define their properties. It will 
also be important to see how these 
properties differ from those present in 
nonresponding cells. 

Ideally, these indicators must repre- 
sent activity induced by the bleaching 
of the photosensitive pigment. Activity 
at the bipolar level is probably too late 
in terms of defining initial response 
characteristics because of the problem of 
localizing which receptor (s) induced a 
given bipolar cell to respond, and be- 
cause several receptors may participate 
jointly in inducing bipolar activity 
through complex synaptic junctions. 
Even the study of the synaptic zone in 
the receptor may not prove of value. 
Sjostrand (1958) has demonstrated the 
presence of synaptic connections between 


receptor terminations in the guinea pig. 
Thus, we must seek indicators or mark- 
ers of response in the photoreceptor it- 
self, central to the outer segment (so 
that the indicator only minimally inter- 
feres with the evaluation of the proper- 
ties of the outer segment, and so that we 
know that sequential activity has been 
induced), but peripheral relative to the 
synaptic zone. One may approach this 
question through electrophysiology, or 
histochemistry, or biochemistry, or ul- 
trastructive research. 

Hagins, Zonana, and Adams (1962) 
have been able to detect a receptor re- 
sponse electrophysiologically in single 
squid receptors. The morphology of the 
squid retinal receptor is somewhat dif- 
ferent from that in higher species. This 
author hopes that Hagins and his co- 
workers can extend their techniques to 
other species. There have been several 
studies of the histochemistry of rod and 
cone receptors in mammals including 
humans in recent years (Berkow & Patz, 
1961; Cogan & Kuwabara, 1959, 1960; 
Kuwabara & Cogan, 1959, 1960; Pearse, 
1961). Pearse (1961) has been able to 
detect a difference in staining between 
a dark-adapted and light-adapted rat 
retina in the pigment epithelial layer. A 
very important approach using bio- 
chemistry is that employed by Lowry, 
Roberts, and Lewis (1956). Various 
other biochemical studies relating to the 
problem have been conducted (Hanawa 
& Kuge, 1961; Pirie & Van Heyningen, 
1956). Lastly, one may turn to electron 
microscopy to evaluate changes due to 
receptor response. Fernandez-Moran 
(1961) has recently reported some inter- 
esting work in the outer segment. : 

When evaluating receptor response it 
is first necessary to determine techniques 
indicating physical or chemical 


of-state in the recep 
Juminous stimulus. The change of or 
must be validated (Enoch, 1963b). Tha 
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is, an effort must be made to relate the 
change of state to the response system 
through correlation of its response char- 
acteristics, and, ultimately, through the 
determination of the actual physical or 
chemical relationships. Lastly, one must 
determine which characteristics of the 
cell (waveguide, photochemical, or 
other) correlate with the change of state 
for a given set of stimulus conditions. 
These ends must be achieved with mini- 
mum disturbance of physiological condi- 
tions. 


PRELIMINARY INVESTIGATIONS OF 
RECEPTOR RESPONSE 


As a first step in the study of receptor 
response this author turned toward his- 
tochemistry, His first paper on the sub- 
ject has recently appeared (Enoch, 
1963b). The following paragraphs sum- 
marize that work briefly. 

The author has chosen the ellipsoid of 
mammalian retinal receptors for study, 
and he has been able to demonstrate a 
difference in staining in light and dark- 
adapted eyes (Figures 3 and 4).* The 
light-adapted retina may be exposed to 
the light either after being excised, or 
during life, prior to dissection. The rat 
retinas shown in the figures were dark 
adapted for 15 hours prior to dissection, 
both were dissected in dim red light, and 
both were incubated in a special medium 
in the dark. The light-adapted retina 
was exposed to the stimulus for 10 min- 
utes starting about 15 minutes prior to 
dissection. The luminance level of the 
extended stimulus evaluated in the plane 
of the entrance pupil of the rat was ap- 

proximately 10,000 millilamberts (hu- 
man photopic evaluation). The source 
of light was a tungsten filament with 
heat and neutral density filters inter- 


4The author wishes to acknowledge the 
i t given to 
enerous advice and encouragemen e 
him during this work by members of his de- 
partment, particularly by Marguerite Con- 
stant and Bernard Becker. 


JAY M. ENOCH 


Fic. 3. This photomicrograph is an example 
of a dark-adapted rat retina which has been in- 
cubated in a medium containing a tissue-cul- 
ture medium (TC 199), succinate, and nitro- 
blue tetrazolium. (No stain was observed in 
this section.) 


posed. The pupil was dilated. Similar 
experiments have been conducted on 
rabbits and squirrel monkeys. The effect 
is present in both rods and cones, The 
eye may be exposed to light a significant 
period of time prior to dissection and the 
difference in staining still obtained. This 
allows regeneration of a substantial por- 
tion of the photosensitive pigments. 
These experiments have only dealt 


Fic. 4. The second retina of the same rat. 
(The eye was light adapted, after having been 
dark adapted, shortly before dissection of the 
retina from the living animal. The retina was 
incubated in the same manner as the specimen 
shown in Figure 3. The ellipsoids—see Figure 
2—of these rod receptors stained heavily.) 
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with the first aspect of the problem, that 
is, the determination of a technique in- 
dicating a change of state in a receptor 
due to a given luminous stimulus. The 
technique has not yet been fully vali- 
dated. The author hopes that this re- 
action will prove to be the indicator 
which he seeks. Obviously, this step only 
represents the beginning of a rather 
complex problem. If the validity of this 
indicator can be established, it may be 
possible to significantly advance our 
knowledge of coding in the retinal re- 
ceptor. 
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CHEMICAL THEORIES 


DAVID G. McCONNELL 
Institute for Research in Vision, Ohio State University 


The chemical theories of the visual process advanced by Wald, Dartnall, 
and others are compared. Their adequacy for scotopic vision is substan- 
tial, while incomplete. Almost no theory successfully explains photopic 
vision. Evidence that rhodopsin is a light-sensitive enzyme is discussed, 
and it is conjectured that present extraction techniques do not permit 
comparison of in situ and in vitro data, particularly, in the photopic case, 


Brier HISTORICAL BACKGROUND 


The history of research in vertebrate 
vision chemistry is about a century old. 
Heinrich Müller (1851) first described 
the reddish color of the outer segments of 
the retinal rods, In 1876 Boll noted that 
the color disappeared rapidly after re- 
moval of the retina, and before removal 
if the animal were exposed to bright 
light. Kühne (1878) then demonstrated 
that the purplish-red pigment Sepurpur 
or “rhodopsin” could be extracted in a 
solution of bile salt, and that the color 
was stable in the dark but bleached in 
the light. Köttgen and Abelsdorff (1896) 
plotted the differences in absorption 
spectra of bleached and unbleached solu- 
tions of the extract for several species, 
and found absorption maxima for mam- 
mals, birds, and amphibians at about 
500 millimicrons and at about 540 for 
fishes. The corresponding pigments are 
conventionally called “rhodopsin” and 
“porphyropsin.” 

Trendelenburg (1904) used a Nernst 
lamp to bleach solutions of visual purple 
and measured the efficiency of different 
parts of the spectrum in bleaching, 
Though he did not allow for differences 
in energy of the bleaching lights, he used 

the same lamp to determine dark- 
adapted visual sensitivity, and found 
that it compared te with the bleach- 
ing efficiency of the lamp. 
we die made by Selig Hecht (1920, 
1921, 1924), using white light and solu- 
tions containing various ratios of 
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bleached and unbleached rhodopsin, en- 
abled Hecht to determine that the course 
of bleaching could be described by a 
monomolecular equation. 

Fridericia and Holm (1925) and 
Tansley (1931) found that vitamin-A- 
starved rats synthesized visual purple 
less rapidly than normals, and George 
Wald (1933) reported direct evidence 
of the presence of vitamin A in eye 
tissues. In experiments with frog retina 
he found a connection between vitamin 
A and visual purple. He also related to 
their presence a new substance which he 
called retinene, extracted from solutions 
of light-bleached rhodopsin. It remained 
for Morton and Goodwin (1944) to 
prove that this pale yellow carotenoid 
was the aldehyde of vitamin A. Even 
before then, however, Wald (1939) pub- 
lished evidence that the pigment of fish 
retinas (porphyropsin) was related to a 
retinene and a vitamin A different from 
those in the frog. They were named 
vitamin A, and retinene., and their 
absorption maxima (Amax) are both dis- 
placed towards the red about as much as 
porphyropsin is displaced with respect to 
rhodopsin. As recently as 1953, Wald 
argued that only these two visual pig- 
ments existed in the rods of vertebrates; 
this has become known as the rhodopsin- 
porphyropsin theory. 


Tue Warp Monet 


The top structure in Figure 1 is the 
retinene, molecule. Its alcohol is vita- 
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Fic, 1. All-trans and 11-cis (neo-b) isomers of 
vitamin A aldehyde, or retinene. 


min A;. The retinene, structure, and its 
vitamin A, alcohol, have an additional 
double bond in the ring, accounting for 
displacement of their absorption spectra 
to the red. The simplest model for the 
chemical process underlying vision at 
the time of Wald’s discovery of a rela- 
tionship between rhodopsin, vitamin A, 
and retinene involved fragmenting of the 
retinene molecule from the protein group 
of rhodopsin, under bleaching by light. 
Once split off, the retinene was reduced 
to its alcohol, and the vitamin A was 
then somehow reincorporated with the 
protein into the light-sensitive rhodopsin 
complex. This model had a substantial 
appeal, because it equated the visual 
stimulus to bleaching, and the big re- 
maining problem appeared to be how to 
get the disjoined chromophore back into 
union with the protein. Things stayed 
this simple only momentarily, however, 
because Wald and his co-workers dis- 
covered that the bleaching product was 
the all-trans isomer of retinene, while 
the isomer which in proper solution in 
the dark would recombine with the pro- 
tein to yield rhodopsin was the bottom 
structure in Figure 1—11-cis or neo-b 
retinene. Thus, though a pathway was 
found for regeneration of rhodopsin from 
solutions of protein and retinene, it was 
not the same retinene as the photoprod- 


uct. Hence the next problem was to 
determine how to get 11-cis retinene 
from all-trans retinene. These were not 
the only isomers of retinene by any 
means, but happen to be central to the 
theory. 

The explanation provided by Wald 
for the difference in the bleaching prod- 
uct and recombining molecule was that 
the light-sensitive rhodopsin contains the 
cis-isomer only. This is bleached to the 
trans-isomer. Although controversy still 
exists as to the chemical nature of these 
products, there is fair agreement as to 
the sequence. The primary photochemi- 
cal reaction converts rhodopsin to the 
intermediate /wmi-rhodopsin, which is 
stable only at very low temperatures. At 
physiological temperature it is thermally 
converted to meta-rhodopsin, which 
Dartnall (1957) calls transient orange. 
Both Dartnall, representing the Liver- 
pool school of thought, and the Harvard 
group under Wald’s leadership appear 
to agree that no isomerization occurs in 
the primary reaction. Whether isomeri- 
zation occurs thermally in conversion of 
lumi- to meta-rhodopsin apparently de- 
pends both on the species of the animal 
and the orientation of the theorist. The 
Liverpool scheme involves conversion of 
transient orange to indicator yellow, S0- 
called because it changes color with pH. 
The Harvard school disputed this, be- 
lieving that retinene was produced di- 
rectly from meta-rhodopsin by hydroly- 
sis. The problem hinged upon whether 
retinene was actually freed off from the 
protein, as the Harvard group believed, 
or remained linked to the protein 
throughout the visual cycle, as the 
Liverpool group believed. Of equal and 
related importance was whether the link- 
age between the protein and the chromo- 
phore was sulfhydryl, as the Harvard 
group believed, or & Schiff base torme 
by bonding of the aldehyde retinene P 
a protein amino group, 4S favored by the 
Liverpool school. 
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In this complicated schema, we have 
already remarked on the uncertainty of 
where isomerization occurs. This is in- 
trinsic to the question of at what stage 
in this model, if any, does the visual 
process get going? Because they argued 
that attachment of retinene to opsin 
still prevailed in the lumi- and meta- 
stages, but no longer did after hydroly- 
sis, Hubbard and Kropf (1958a) in re- 
marking on this problem have stated that 
visual excitation probably precedes the 
release of retinene and depends upon the 
light reaction itself, or, at the latest, on 
the conversion of lumi- to meta-rhodop- 
sin. However Hagins (1956) showed 
that bleaching is fast enough to precede 
the electroretinogram, and for this and 
other reasons it must be acknowledged 

that the data are still too incomplete to 
permit pinpointing the time of origin of 
the visual sensation. It seems possible, 
however, as will be discussed in more de- 
tail below, that excitation is synonymous 
with light-induced change in the activity 
of an adenosine triphosphatase enzyme 
which the author believes to be one and 
the same with opsin. Once again, how- 
ever, the time scale of this change on the 
order of milliseconds as required to 
describe excitation is presently lacking. 
At the heart of any ultimately ac- 
cepted chemical theory of visual excita- 
tion will certainly be an explanation of 
the mechanism for regeneration of rho- 
dopsin from the bleached product. While 
in solution, Hecht showed, bleaching 
continues to completion and is dependent 
upon duration and intensity of the light, 
in the intact retina bleaching is no longer 
proportional to intensity. Hagins (1955) 
showed this when he demonstrated that 
for very short flashes of light, about 
half of the rhodopsin available in vivo 
is bleached regardless of intensity. Some 
process which is not understood opposes 
the bleaching of rhodopsin in the living 
retina. That this process is concerned 
with the pigment epithelium has been 


suggested in several quarters. Indeed, 
Kiihne had determined as early as 1878 
that contact with the pigment epithelium 
was necessary for the retina to regain 
its color in the dark, after bleaching. 
Hubbard (1956) found that an extract 
of the epithelial pigment which she 
called retinene isomerase was effective 
in converting the all-trans retinene to 
the 11-cis isomer, and that this property 
was significantly greater in the light than 
in the dark. Rushton (1958a) argued 
against the significance of this phe- 
nomenon for the in vivo retina, because 
of the results of an experiment he re- 
ported from his laboratory. Blue and 
yellow lights, matched for equal bright- 
ness by a dark-adapted human subject, 
were equally effective in bleaching 
rhodopsin, as measured by Rushton’s 
apparatus for analyzing light reflected 
from the live retina. Regeneration fol- 
lowing removal of the two lights was 
also identical. Since retinene in solution 
absorbs blue over 10 times more than 
yellow, the blue bleaching should have 
been more overcome by isomerization 
than the yellow. Rushton’s own model 
(1958b) entails an equilibrium between 
retinene and vitamin A, under control 
of an enzyme-coenzyme complex. In 
this regard, his model is very close to 
that of the Harvard group. Since re- 
generation of the bleached photopigment 
is intimately tied to the presence of the 
epithelial pigment, the concept of a diffu- 
sion process between it and the receptors 
has been entertained by the Harvard 
group. Though such a process may exist, 
it is not necessarily the process which 
opposes bleaching. Another interesting 
possibility is that a very rapidly formed 
free radical, such as is known by elec- 
tron spin resonance to be associated with 
the mammalian retinal melanin pigment 
(Cope, Sever, & Polis, 1963) opposes 
bleaching. The possibility of the move- 
ment of charge carriers between the 
receptors and the epithelial pigment 
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cannot be discounted. Finally, work in 
the author’s laboratory has shown that 
thoroughly bleached rhodopsin, as well 
as extracts of bleached retinas, still con- 
tain a strong light-sensitive enzyme 
which recovers substantial activity when 
left in the dark. It is not necessary for 
the rhodopsin to recover its typical ab- 
sorption spectrum in order for the en- 
zyme to recover activity, though more is 
recovered when retinene is available. 

A recapitulation of the Wald model 
appears in Figure 2. Rhodopsin is con- 
verted by light, in the first step, to 
orange intermediates which are hydro- 
lyzed at sufficiently high temperatures to 
all-trans retinene and its protein scotop- 
sin. In the upper of the two horizontal 
reactions, trans-cis isomerization of the 
retinene occurs, under the control of 
retinene isomerase and light. In the 
lower reaction the trans-vitamin A is 
isomerized. The oxidation-reduction ex- 
changes of hydrogens between retinene 
and the alcohol are controlled by alcohol 
dehydrogenase and the coenzyme DPN. 
Ultimately, the 11-cis isomer of retinene 
combines in the dark with opsin to 
resynthesize rhodopsin as fast as it is 
broken down by bleaching. 

Comparison of the density spectrum 
of human rhodopsin with spectral sensi- 
tivity for the dark-adapted human eye 
has been carefully made by Crescitelli 
and Dartnall (1953), and exhibits a 
rather good correspondence. This agree- 
ment, and the absence of other photo- 
pigments from solutions extracted in the 
dark from rod outer segments, has made 


Rhodopsin 


orange Intermediates 
ll-cis retinene +opsin -25 oll- trons rotinene + opsin 


n-cis vitamin A -2 oll- trons vitamin A 


Fic. 2. Simplified diagram of the Wald model 
of the visual cycle. 


it very difficult to suppose that another 
molecule beside rhodopsin is responsible 
for human scotopic vision. For human 
photopic vision, however, no model is 
available which even approaches coher- 
ence between chemical and psychophy- 
siological facts. 


How Many VISUAL PIGMENTS? 


In the chicken retina, Wald (1937) 
was able to extract along with rhodopsin 
a very small amount of a pigment with 
Amax Of 562 millimicrons. On the basis 
of later experiments Wald, Brown, and 
Smith (1952, 1953) concluded that this 
pigment—iodopsin—consists of the same 
retinene cis-isomer as rhodopsin con- 
tains, but, instead of the rod protein 
scotopsin, a cone protein which they 
called photopsin. Furthermore, just as 
scotopsin will combine either with the 
retinene, cis-isomer to make rhodopsin 
or with the retinene. cis-isomer to yield 
porphyropsin, photopsin should combine 
with either the retinene, or retinene, cis- 
isomers to give iodopsin or “cyanopsin,” 
respectively. Cyanopsin, then, is the fish 
analogue of the chicken cone pigment, 
iodopsin. In this scheme, there is one 
rod pigment and one cone pigment in 
any animal, though pigment pairs may 
differ from species to species. Crescitelli 
(1958) has published data on the dis- 
tribution of the Amax of bleachable pig- 
ments from the retinas of a number of 
vertebrates. There appears to be very 
little evidence from his study that inter- 
species differences can be categorize 
according to the Wald rhodopsin-por- 
phyropsin theory. Nor does it appear 
that in any animal a cone pigment am 
a rod pigment can be readily differenti- 
ated by easily applied extraction t 
niques. Wald’s categorization of ra 
bleachable pigments has been or, 
for its failure to apply rigorous tests 0 
the homogeneity of the pigments. D 
some cases, homogeneity can be argue 
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from the physical chemistry of the 
bleaching curves, after extraction. In the 
case of the cone pigments, however, 
miniscule quantities are probably ex- 
tracted, if at all, and they are easily con- 
fused with impurities. Moreover, it is 
not at all certain that all retinal photo- 
pigments bleach in the light, even though 
they may be photosensitive. Nonethe- 
less, most investigators are in agreement 
with an important (though certainly 
informal) generalization of Wald’s rho- 
dopsin-porphyropsin theory—that all 
visual pigments are members of some 
homologous chemical series, We may 
recall that porphyropsin, to illustrate, 
differs from rhodopsin merely in the 
presence of a single additional double 
bond in the ring, Extension of the con- 
jugation in this or some other manner 
is expected to displace absorption toward 
the red. On the other hand, interrup- 
tion of the conjugated system in some 
manner may be expected to shorten the 
wavelength of the chromophore. Dart- 
nall (1952) pointed out that when the 
absorption spectra of a large number of 
visual pigments are slid along the hori- 
zontal axis (wavelength) so that their 
absorption maxima coincide, there is re- 
markable agreement among them. Yet 
these considerations alone hardly con- 
stitute a sufficient basis for acceptance of 
the homologous series generalization. 
Perhaps the greatest arguments for such 
a theory are parsimony and the existence 
of analogues iw other ubiquitous biologi- 
cal systems such as the porphins, 


New APPROACHES 


Refinements in knowledge and control 
of biological chemical systems frequently 
depend in part on in vitro analysis of 
extracted material such as we have just 
described in the case of rhodopsin, and 
in part on synthetic models. ‘Thus far 
this has been true with the visual pig- 
ments and promises to be more so in the 
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future. Organic synthesis played a major 
rôle, often ignored, in Wald’s orderly 
development of our knowledge of the 
retinenes. Synthesis also was prominent 
in the resolution of the difference of 
opinion—to the extent that it has been 
resolved—between the Harvard and 
Liverpool schools on the mode of at- 
tachment of the chromophore to the pro- 
tein. This controversy arose from prob- 
lems connected with indicator yellow— 
one of the products of the bleaching of 
rhodopsin. Figure 3 shows the synthetic 
compound retinylidenemethylamine, an 
indicator yellow analogue formed as th 
Schiff base of all-trans retinene, an’ 
methylamine, In the form shown iv 
Amax in ethanol is 365 millimicrons. Upo: 
acidification, Amax increases to 440 milli- 
microns. Pitt, Collins, Morton, and Stok 
(1955) attributed the acid shift to proto- 
nation of the nitrogen. In the case of 
true indicator yellow, their theory would 
replace the methyl group with protein— 
ostensibly scotopsin. Both the Harvard 
and Liverpool groups now appear to 
accept the Schiff base as the type of bond 
between retinene and opsin. It must be 
pointed out, however, that the acid shift 
found with such compounds is not in 
itself big enough to account for the 
rhodopsin shift of 120 millimicrons re-i 
ported by Hubbard and St. George 
(1958). Moreover, these Schiff bases do 
not bleach with light. l 
A number of theorists have suggested 
that differences in absorption maxima 
of various visual pigments may be at- 
tributed to different degrees of extension 
of the conjugation of the chromophore 
into the protein. The identification- of 
appropriate amines in the protein re- 


So NaNO CH=NCH3 


Fic. 3. Retinylidenemethylamine, the Schiff 
base of retinene and methylamine. 


_ detected in 
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mains a formidable task. Most investi- 
gators have assumed, like Dartnall 
(1957), that the principle components of 
opsin—with Amax around 290 millimi- 
crons—are such amino acids as tyrosine 
and tryptophane. The organic chemist 
Oster (1958) has stated that the only 
protein residues identifiable -by their 
ultraviolet spectra are tyrosyl and phen- 
ylalanyl, and no additional conjugated 
structures. He regards extension of con- 
jugation by the protein as therefore un- 
likely. In the author’s laboratory, how- 
ever, it has been shown that a number 
aof nucleoside phosphates—derivatives of 
„nucleic acids—have sharp absorption 
peaks in this general vicinity. Moreover, 
their involvement in the visual process 
‘has been indicated by the recent dis- 
covery (McConnell & Scarpelli, 1963) 
that rhodopsin extracts contain sb- 
stantial amounts of a light- and wave- 
length-sensitive enzyme which splits off 
phosphate from nucleoside triphosphates 
—preferentially adenosine triphosphate. 
The liberation of each phosphate in the 
typical reaction of this kind releases up 
to 11,000 calories of energy per mole of 
adenosine triphosphate, and may pro- 
vide the energy for driving other reac- 
tions involved in the propagation of the 
visual sensation. 

As @ compromise in the controversy 
over whether the retinene-opsin binding 
was Schiff base, as claimed by the Liver- 
pool group, or sulfhydryl, as Wald had 
earlier claimed, Dartnall (1957) pro- 
posed that the sulfhydryl groups Wald 
bleached solutions were ex- 
posed by the uncovering of the opsin 
during isomerization of the retinene. In 
this way the Schiff base bond is allowed, 
put the appearance of additional sulf- 
hydryl groups upon bleaching could still 
þe explained. In our own laboratory it 
has recently been shown that opsin con- 
tains sulfhydryl groups which behave, 
jn the presence of known blocking 
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reagents, ver a 
ciated with the oe i. oe 
activity of i phosphatsg 

y of myosin—the muscle protei 
It also appears that the activi EA 
; e ivity of opsin 
ra ee of adenosine triphosphate 
ci y removal of retinene i 

enhanc i A . ar07 JS 
SEA br nakon al S 
be opsin, and the function of the ooraphes 
may be twofold: as a cofactor for the 
efficient splitting of adenosine triph 4 
phate, and as a light-trap sen ihe 
enzyme is illuminated. It is not yates 
at the present time to prove or disprove 
the “unmasking” proposal of Dartnall. 
However, if it is true, the increase in 
sulfhydryl groups brought on by bleach- 
ing would have to reduce the activity of 
the enzyme, at least up to a certain 
point, in order to be consistent with our 
data. 

At Michigan State University, Bar- 
nett Rosenberg (1962) has reported ex- 
periments on semiconductors made of 
two sheets of conductive glass between 
which lies a glasslike preparation of B- 
carotene—the prototype of the chemical 
family to which retinene belongs. These 
experiments are of great interest be- 
cause results so far indicate the pos- 
sibility that a single chromophore of 
this chemical family may give a different 
photoconductive response for every 
wavelength absorbed. When a voltage is 
impressed across the sandwich, incident 
light causes a photocurrent the magni- 
tude, polarity, and distribution in time 
of which are determined by the wave- 
length of the incident light. In Figure 4 
are plotted on the same time axis the 
photocurrents produced by four dif- 
ferent lights in a g-carotene glass cell 
across which was applied 300 millivolts. 
Note that the polarity of the response to 
yellow and red lights is opposite to that 
of the blue and green responses. Proper 
adjustment of intensities of any two 
lights which separately produce opposite 
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ON OFF 


Fic. 4. Photocurrents resulting from appli- 
cation of lights of four different colors to a 
B-carotene preparation. (Ordinate: current; 
abcissa: time. Courtesy of B. Rosenberg.) 


polarities of response produces a re- 
sponse to both lights, simultaneously 
presented, which is comparable to the 
response to a single light of intermediate 
wavelength. Although the particular re- 
lation obtained between wavelength and 
resulting photocurrent holds only for the 
restricted geometry of the sandwich he 
chose, the implications of this research, 
like that of our enzyme research, for 
psychophysical theory are large and in- 
escapable, Rosenberg has already tenta- 
tively related his data to those of Svae- 
tichin (1956) and of MacNichol and 
Svaetichin (1958), from which it ap- 
peared that insertion of microelectrodes 
to certain positions in isolated fish ret- 
ina permitted recording of a response 
which reversed polarity as wavelength 
increased. It must be remarked that 
MacNichol and Svaetichin showed the 
probable location of the microelectrode 
tip for this particular kind of response 
as being in the inner part of the inner 
nuclear layer, relatively far removed 
from the photopigment. Such reversals 
of polarity could be explained by neural 
hypotheses as well as by chemical ones. 
And as might be expected, psychologists 
and electrophysiologists have already be- 
gun to make war on Rosenberg’s infer- 
ences. Nonetheless, his data demand 
being related to the possible chromo- 
phore in the living retina, and persone 
approaches this task with refreshing 


independence. 
e other facets of his theory, while 


not particularly upsetting to psycholo- 


gists, are challenging to previous models 
for vision. The possibility must be con- 
sidered that bleaching, including isom- 
erization, does not supply the initial 
energy for the visual process at all, but 
plays a nonetheless important role in 
removing from the range of energetic 
excitability large chunks of the photo- 
pigment. So regarded, bleaching would 
be superfluous for photopigments with 
relatively high activation thresholds, 
and we should therefore not be sur- 
prised by the presence in the retina of 
unbleachable visual pigments, probably 
in cones. In such systems, whether they 
bleach or not, the energy would be 
photoconductive and continuous. If, as 
Rosenberg suggests, it is also wavelength 
dependent, an almost indefinitely large 
variety of responses could be expected 
from one lowly receptor. 

Our enzyme research shows that when 
opsin is extracted from bleached outer 
segments of cattle retinas, in which the 
un-isomerized retinene must be consid- 
ered at a minimum, the activity of the 
enzyme is still strong, discriminates light 
from dark, but lacks the wavelength 
sensitivity of rhodopsin. This is re- 
gained in the presence of added retinene. 
Electron micrographs show the site of 
this enzyme to be the outer segment of 
both rod and cone (Figure 5). If the 
color sensitivity of a single receptor is 
adequately represented by the extract 
we work with—an assumption always in 
question but never disproved—a single 
receptor would be capable of color dis- 
criminations. 

Lastly, let us consider briefly the ap- 
proach of Rushton (1955, 1958c) who 
directs light at the retina in the living 
organism, and measures changes in the 
density of the bleachable pigments by 
comparing the reflected energy of the 
bleaching light with that of a nonbleach- 
ing light. This is a difficult measure- 
ment, particularly in some species which 
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Fic. 5. Artist’s representation of electron- 
microscopic distribution of adenosine triphos- 
phatase in outer segments of red rod (left) 
and principal cone (right) in the frog retina. 
(Courtesy of D. G. Scarpelli and E. L. Craig.) 


are heavily pigmented, including man. 
The density of rhodopsin at various 
points on the dark-adapted retina as 
determined by this method shows good 
correspondence with the histological dis- 
tribution of the retinal rods. Both 
rhodopsin and rods are, as expected, 
absent in the fovea and the optic papilla. 
Like the correspondence reported by 
Crescitelli and Dartnall between spectral 
sensitivity of the dark-adapted human 
eye and the density spectrum of human 
rhodopsin, these data strongly imply one 
dark photopigment—rhodopsin. 

In the photopic case, Rushton re- 
ported identification of two bleachable 
pigments—chlorolabe and erythrolabe, 
whose Amax ate in the green and red, 
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respectively. Chlorolabe, moreover, was 
the only pigment shown by this tech- 
nique in the retina of the protanope, 
since the difference spectrum obtained 
was independent of the wavelength of 
the bleaching light. The difference spec- 
trum also agreed well with the protan- 
ope’s spectral sensitivity. An examina- 
tion of the fovea of a deuteranope, and 
another of a deuteranomalous fovea, dis- 
closed only about 10% of the pigment 
density of the protanope—too little on 
which to make determinations. Identi- 
fication of the third pigment generally 
accepted as required to make color 
matches has not yet been reported by 
Rushton. 

The method, despite its cleanness, 
suffers from two obvious limitations. One 
is psychological: threshold determina- 
tions are made by the traditional verbal 
report of the subject—which it would be 
difficult to represent to a chemist as 
indicating absorption or nonabsorption 
of light. The second is that the method 
applies only to pigments which will 
bleach. Whether the blue cone pigment 
is unbleachable in the human retina, or 
some other problem prevents its identi- 
fication by this method, is not at present 
clear, Suggestions that the blue-sensi- 
tive pigment is actually the rod pigment 
have not met with much favor among 
most investigators. Our own finding that 
a photostable pigment with Amax about 
420 millimicrons regenerates from cattle 
or rabbit opsin and retinene, together 
with Dartnall’s report (1960) of a pig- 
ment extracted from the pure-cone ret- 
ina of the gray squirrel which had an 
absorption maximum at 502, are only 
two of several contrasting facts which 
require that the question be left entirely 
open.* 

P A ipt was written, Marks 
Gan ee Mar Nichol (personal com- 


munication, 1963) have reported SOE ade 
spectra for red, green and blue isolated, 
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The differentiation, in the same ex- 

tract, of scotopic and photopic pigments 
appears to this author to be at one and 
the same time the most frustrating and 
the most promising avenue of research 
into the chemistry of the receptors. Evi- 
dence is now mounting that the system 
as we find it in extracts is a far cry from 
that in the receptor. A particularly im- 
portant example is illustrated by our 
present inability to extract from the 
outer segments elements of the cyto- 
chrome apparatus which we know from 
light (McConnell, in press) and electron 
histochemistry is present to substantial 
degree in these structures. There is 
excellent correlative reason to believe 
the cytochromes are intimately linked 
with the adenosine triphosphatase en- 
zyme which we can extract. Moreover, 
it is not too early to suggest that these 
cytochromes may be associated with the 
photopic color-sensitivities of the retina. 
Development of an appropriate extrac- 
tive technique is of pressing concern. 
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DISCUSSION: 
COMPETING THEORIES OF RECEPTOR EXCITATION 


R. M. BOYNTON 
University of Rochester 


This discussion is mainly an examination of a theoretical idea suggested 
by Enoch and McConnell. They suggest that a single photoreceptor may 
be able to respond so that the output of the receptor is “tagged” accord- 
ing to the wavelength of the quantum absorbed. This idea stands con- 
trary to the classical trichromatic scheme, according to which the proba- 
bility of quantum absorption depends upon the energy of the incident 
quantum (inversely related to wavelength), but where once a quantum is 
absorbed all information pertaining to the wavelength associated with that 
quantum is forever lost. The trichromatic scheme is reviewed, together 
with the evidence which supports it. Difficulties raised by some of the 
suggestions of Enoch and McConnell are then evaluated in this context. 


I wish to comment particularly upon 
a controversial theoretical idea that has 
been seriously proposed by McConnell, 
and which also has been mentioned by 
Enoch. The suggestion is that a single 
receptor in the eye may be capable of 
generating a differentiated spectral re- 
sponse, one that is “tagged” in some way 
so that its response carries with it in- 
formation about the wavelength of the 
light absorbed. In other words, it is pro- 
posed that a single receptor may “know” 
what the wavelength of the incident light 
is. It is also implicit in this argument 
that the receptor be capable of transfer- 
ring this differential information about 
wavelength to higher levels of the visual 
system. 

These speculations stand diametrically 
opposed to a general principle of color 
discrimination which may be called the 
“trichromatic scheme.” According to 
this principle, the responses of receptors 
are not tagged according to the wave- 
length of the light that excited them. 

I have decided to make it my principal 
responsibility in this discussion to: (a) 
describe what the trichromatic scheme 
is; (b) indicate the kind of experimental 
evidence that has made it the favorite 
scheme of the color theorist for more 
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than a century; and (c) with a and b 
as background, point out the implausi- 
bility of the tagged-response theory of 
color discrimination. Before doing so, 
however, it will be useful first to consider 
the simpler case of rod vision. 


Rop VISION 


When the maximum luminance level 
of environmental objects is low—below 
about 5 or 6 microlamberts—there is no 
color discrimination, and all vision (ex- 
cluding the effects of memory color) is 
black and white. The generally-accepted 
reason for this is that scotopic vision is 
mediated by only one class of receptor— 
the rod. Either all rods have the same 
spectral sensitivity, or, if they do not, 
there is no separation of the signals 
transmitted by those having one type of 
spectral sensitivity and those having an- 
other. In either case, the stimulation of 
rods by light of any wavelength leads 
to the same sensation of scotopic white 
which is produced by stimulation by any 
other wavelength, providing that rela- 
tive intensities are suitably adjusted. 
When many quanta are incident upon 
the eye, the resulting sensation depends 
simply upon the rate at which quanta are 
being absorbed by a collection of rods 


DISCUSSION 


(for any fixed condition of adaptation). 
Not only does the rod not know where, 
along its length, the absorption of a 
given quantum occurred, but the optic 
nerve fiber to which a collection of rods 
is synaptically connected has no informa- 
tion concerning in which particular rods 
the quanta were absorbed. Moreover, 
and this is most important for the pres- 
ent argument, the receptor also does not 
know what the wavelengths of the ab- 
sorbed quanta were. 

Whether or not a given quantum will 
be absorbed is unpredictable, being a 
matter of probability. Some quanta, 
however, stand a much greater probabil- 
ity of absorption than do others, de- 
pending upon their energy. For example, 
suppose that the maximum probability 
of a quantum being absorbed in some 
receptor is about .20. This maximum 
probability will occur at the peak wave- 
length of scotopic sensitivity, at a wave- 
length of about 505 millimicrons. At 
this wavelength, a quantum has an en- 
ergy value of slightly less than 4x 107° 
erg (see Table 1). Any quantum pos- 
sessing more or less energy than this is 
less likely to be absorbed. For example, 
a quantum having an energy of 3x 107? 
erg (corresponding to about 660 milli- 
microns in the red) will stand less than 
one chance in 10,000 of being absorbed. 


TABLE 1 


EQUIVALENT WAVELENGTH, ENERGY PER QUAN- 
TUM, AND ASSOCIATED Cotor SENSATIONS 
FOR THE NEUTRALLY-ADAPTED EYE 


Wave- Color 


Energy per } 

length quantum sensation 

400 4.96 x 10™ erg Violet 

450 4.41 x 10” erg Blue (slightly 

reddish) 

500 3.97 x 10 erg Bluish green 

550 3.61 x 10™ erg Yellowish green 

600 3.31 x 10™ erg Orange 

650 3.06 x 10™ erg Red 

700 2.83 X 10™ erg Red 
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However—and this is the main point— 
once absorption has taken place, the 
effect produced by one quantum is the 
same as that produced by another. The 
molecule of rhodopsin that absorbed the 
quantum will have reacted in an all-or- 
none manner without receiving any in- 
formation about what the energy of the 
quantum was. 

The psychophysical evidence in sup- 
port of the point of view just presented 
lies mainly in the experimental work 
underlying the relation: 


JE V^ SEV". 


Here V’, is the scotopic luminous effi- 
ciency function, and Ey) and En are 
taken to be any two distributions of 
radiance in fields which match. For the 
standard observer, any two fields which 
satisfy this equation will match, all will 
have the same scotopic white color, and 
all potential information about the hue 
and saturation of objects is lost. 


THE TRICHROMATIC SCHEME 


The main point about the trichromatic 
scheme that I wish to emphasize is that 
it implies the same assumptions for cone 
vision that have just been reviewed for 
rod vision, except that in this more com- 
plex case there must be three classes of 
photopigment. One class has its maxi- 
mum absorption in the red, meaning that 
quanta having an energy of about 
3.3x 107? erg are most likely to be ab- 
sorbed, with the probability falling off 
for quanta of higher or lower energy. 
Another class has its maximum proba- 
bility of absorption at about 3.7% 10 
erg in the green, with the third at about 
4.6x 107? erg in the blue. ^ 
quantum (say in the green) ml 
absorbed by any one of the tł 
of photopigment, but is more likely to be 
absorbed in the “green” kind than 1m 
either of the others. Once absorbed, a 

ser Of by which kind of pigment, the 
gardless by W, i is located 
receptor in which that pigment 35 
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does not know: (a) what energy level 
that quantum possessed, (b) in which 
particular molecule along the length of 
its outer segment the quantum happened 
to be caught, or (c) in which type of 
pigment molecule the absorption took 
place (if, as is theoretically possible, 
there should be a mixture of pigment 
types within the receptor). 

The trichromatic scheme is not con- 
cerned with details about the processing 
of chromatic information beyond the 
level of photopigments, but perhaps a 
few comments about the great beyond 
are nevertheless in order. The trichro- 
matic scheme is much more restricted in 
its scope than is a complete trichromatic 
theory of color vision. The simplest 
trichromatic theory states that there are 
three cone types (one for each pigment) , 
three classes of optic pathway (one for 
each receptor), and that color sensations 
depend somehow upon the blending of 
these three classes of signal at the brain. 
One can, however, be a supporter of the 
trichromatic scheme and nevertheless 
adopt much more complicated ideas con- 
cerning what happens at subsequent 
stages in the visual system. For example, 
the evidence for an “opponent colors” 
transformation is now very strong, but 
the idea of such recoding of input in- 
formation does not in any way alter the 
validity of the concepts involved in the 
basic trichromatic scheme. I have com- 
mented more extensively on this else- 
where (Boynton, 1960), 


EVIDENCE SUPPORTING THE 
TRICHROMATIC SCHEME 


The one overwhelmingly important 
reason why the trichromatic scheme has 
been so popular is that it easily accounts 
for the remarkable facts associated with 
color mixture. Briefly stated, color mix- 
ture concerns the fact that there are 
many pairs of physically different eames 
which, when directly compared, matc 
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exactly for hue, brightness, and satura- 
tion—that is, for color. Such pairs are 
called metameric pairs, or metamers. 
When a systematic effort is made to de- 
termine the number of variables that 
will allow the normal observer to match 
any color, the number turns out to be 
exactly three—no more, and no less. 
Thus any color may be matched by a 
suitable mixture of exactly three pri- 
mary colors, with the choice of primaries 
being arbitrary within these bounds: (a) 
no mixture of two of the primaries can 
match the third, and (b) in certain cases, 
one of the primaries may be added to 
the side of the colorimetric matching 
field that contains the color to be 
matched, being treated as if it were a 
negative amount of light added to the 
other side. 

Another important series of facts 
about color vision are known as “Grass- 
man’s laws.” These can be variously 
stated, but I think this is the simplest 
way: there exists an approximate iso- 
morphism between the empirical system 
of color-matching experiments and the 
formal system of ordinary algebra. If we 
replace the “=” sign of algebra by the 
“matches with” operation of colorimetry, 
the “+” sign of algebra by the optical 
superposition of lights in the colorimeter, 
and if we take the “—” operation to im- 
ply, as previously noted, that the stimu- 
lus in question is on the “wrong” side 
of the matching field, then (to the ex- 
tent that Grassman’s laws are really 
true) the formal and empirical systems 
are isomorphic. This permits one to cal- 
culate many possible experimental re- 
sults without doing each and every ex- 
periment. It also becomes possible to 
transform from one system of primaries 
to another, The facts of color mixture 
have been collated and standardized in 
an internationally recognized colorimet- 
ric system, in which the Commission 
Internationale de lÉclairage chromatic- 
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ity diagram is perhaps the crowning 
achievement (LeGrand, 1957, Ch. 8). 
In this rather widely-misunderstood dia- 
gram, all stimuli which are metameric 
with respect to each other plot at a com- 
mon point, all color mixtures fall along 
straight lines, and the spectral colors 
appear along a partially-closed curved 
locus within which all real mixtures must 
lie. 

The trichromatic scheme accounts, at 
one fell swoop, for the whole constella- 
tion of facts about color mixture that 
have just been described. It also ac- 
counts, if one assumes the pigment con- 
centrations to be low, for the resistance 
of metameric matches to alteration by 
adaptation or changes in luminance level. 
The three-pigment hypothesis of the 
trichromatic scheme accounts for these 
facts as follows. If two stimuli match 
exactly in all respects, then it is supposed 
that the rate of quantum absorption by 
each of the three kinds of photopic visual 
photopigments is exactly the same for 
both stimuli, Unless something happens 
to alter the spectral sensitivity exhib- 
ited by the receptors containing these 
pigments, any pair of metameric stimuli 
behave as they do simply because they 
exert exactly equivalent effects at this 
very peripheral level of the visual sys- 
tem. They are indeed “physiologically 
identical” and from this point on, it 
matters not what complex transforma- 
tions may occur in the encoding of the 
information, how many receptor types 
there are, or how much of which pigment 
is in each receptor. 

Now both Enoch and McConnell have 
speculated that the single receptor may 
not merely instigate an undifferentiated 
signal correlated with the quantity of 
absorbed light, but that the job of wave- 
length discrimination may be handled in 
part by, and within, the individual re- 
ceptor. Rosenberg’s model, for example, 
suggests that the sign and temporal char- 


acter of the response of a given type 
of photopigment might depend upon 
wavelength. The model is completely 
antithetical to the somewhat updated 
view of the classical scheme that I have 
tried to describe here and, it seems to 
me, raises far more questions than it 
answers. For example: 


1. Exactly how can the trichromacy 
of human color vision be accounted for 
by this model? 

2. How can the information related to 
wavelength be retained and transmitted 
to the higher visual centers, along with 
information related to intensity? 

3. Where do Grassman’s laws come 
from? 

4, Why is no information about wave- 
length transmitted by the scotopic sys- 
tem? 


Another body of experimental data 
which has been to some extent accounted 
for by the trichromatic scheme, though 
not without difficulty, is chromatic adap- 
tation (MacAdam, 1961; Stiles, 1961). 
Time simply does not permit me to go 
into the matter. Let me say, however, 
that I cannot see any way that the 
Rosenberg type of model can even begin 
to account for the experimental facts. 


DIFFICULTIES RAISED 
BY EnocR’s WORK 


I would like now to consider some of 
what Enoch has said, from the stand- 
point of the trichromatic scheme. After 
considering the possibility that the indi- 
vidual receptor might generate a dif- 
ferentiated response depending upon 
wavelength, he has admitted that the 
undifferentiated response ig much more 
probable. But there are problems for the 
trichromatic scheme raised by his im- 
portant experiments concerning wave 


guide effects in receptors, from which 


work it seems quite clear that the trans- 
mittance of a given retinal cone 1S by no 
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means independent either of the wave- 
length or the angle of the incident light. 
Such waveguide effects would have the 
same consequence, in principle, as the 
interposition of directionally-sensitive 
color filters in front of the separate re- 
ceptors. It is quite correct that cones 
with three such filters and only one kind 
of photopigment could conceivably do 
the job of color discrimination very well 
and be entirely consistent with the tri- 
chromatic scheme. The key question, 
which Enoch has implied but perhaps 
has not stated straight out, concerns 
whether the selective wavelength effects 
that he observes are sufficiently stable 
and of the right quantitative sort to al- 
low these to serve as the “filters” of 
human receptors containing only one 
photopic photopigment. To me, this 
surely seems unlikely: first, there is the 
experiment of Brindley and Rushton, 
which Enoch mentioned, where mono- 
chromatic light was put through the 
sclera to stimulate the receptors from be- 
hind, with no color change reported 
when compared to ordinary stimulation. 
One can also observe a similar lack of 
gross color change when observing mono- 
chromatic scattered light in the eye, or 
that reflected near the optic disc. Fi- 
nally, the effects of change of angle of 
incidence and wavelength are very simi- 
lar, so far as the waveguide modes are 
concerned, so that it is difficult to under- 
stand how the receptor could ever sort 
them out. 
On the other hand, there are a number 
of instances which I have deliberately 
side-stepped to this point, where the 
trichromatic scheme breaks down. One 
of these concerns the failure of meta- 
meric matches to hold at high levels or 
after intense adaptation, Another con- 
cerns the inability of subjects actually to 
make perfect color matches since such 
matches are often not perfect, even 
though the best match that the subject 
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can make is highly reproducible, as psy- 
chophysical judgments go. A third fail- 
ure of the trichromatic principle con- 
cerns the Stiles-Crawford effect of Type 
2 (Enoch & Stiles, 1962), which requires 
the postulation of a change of spectral 
sensitivity in at least one mechanism 
depending upon the angle of incidence 
of light upon the receptor although the 
change required is not large. A fourth 
problem concerns the failure of some of 
Grassman’s laws to hold, particularly in 
peripheral vision, Such effects as these 
seem to be variations on the main theme. 
They seem to be saying: “The trichro- 
matic scheme is almost true, but actually 
it is only an approximation. There must 
be complications.” Enoch’s work shows 
us where some of these complications 
probably lie. 

The facts of color mixture depend 
upon the stimulation of many thousands 
of receptors. Color discrimination is very 
poor in small fields and improves with 
area at a much faster rate than lumi- 
nance discrimination (MacAdam, 1943). 
If there were no correlation between the 
“filtering” action caused by variations 
in the dimensions of the tiny recep- 
tors and the photopigments contained 
therein, then psychophysical investiga- 
tion with reasonably large areas would 
give us information closely related to 
photopigment sensitivities, despite the 
complications introduced by the wave- 
guide effects. The only requirement is 
that the numbers of receptors be suf- 
ficiently large for random effects to can- 
cel out. This suggestion, which is one of 
the possibilities considered by Enoch, 
seems most probable to me. It is com- 
patible so far as I can see with other 
evidence. 

One can imagine that Nature said, as 
she went along her evolutionary way, 
Let’s build a color-discriminating system. But, 
in order to keep the good spatial resolution 


that we developed during the past millenium 
(which, after all, is the capacity of paramount 


et 
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importance in vision), the receptors must be 
very small. This in turn interferes with the 
stable spectral sensitivity of receptors contain- 
ing a given kind of photopigment. How do 
we solve this problem? [a] By making color 
discrimination depend upon the stimulation of 
many receptors; [b] by dissociating color vis- 
ion to a large extent from the spatial acuity 
task; [c] by creating three or more classes of 
photopic receptors, with each member of a 
class containing exactly the same pigment or 
ratio of pigments, and then by causing the 
initial coding of color information to be in 
terms of the relative activation of these vari- 
ous receptor types. 
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DISCUSSION: 
COMPETING THEORIES OF RECEPTOR EXCITATION 
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The opinion is expressed that the classical theory that the color receptors 
of human vision contain rhodopsin-related photopigments must be aban- 
doned in part, in view of evidence presented by Enoch and McConnell. 
Additional evidence against the classical theory is presented, derived from 
luminosity data on patients with retinal abnormalities which resulted in 
loss of all but 1 color system. The view is expressed that the color 


receptors are each incapable of differentiated response to equal amounts 
of absorbed energy of different wavelengths. 


What seems to me the most important 
purpose of the present symposium has 
been well served by the presentations 
prepared by Enoch and McConnell. 
Enoch and McConnell have presented 
what must be conceded are unconven- 
tional theories of receptor excitation in 
the retina. It may be said that both 
have presented alternatives to the clas- 
sical theory that the action spectra of 
the color receptors may be understood as 
absorption spectra of rhodopsin-related 
photopigments. They have at least de- 
scribed complications or difficulties with 
the classical theory. Enoch has pre- 
sented evidence that the optical proper- 
ties of receptors influence their response 
to different wavelengths, and McConnell 
has presented evidence that the Wald 
model of excitation by breakdown of 
rhodopsin is inadequate. Both have per- 
haps gone unnecessarily far in suggest- 
ing that a single receptor may be capable 
of giving differentiated responses to dif- 
ferent absorbed wavelengths other than 
those due to differences in the absorbed 
quantity of energy. Boynton has desig- 
nated the notion that a single receptor 
cannot respond differentially to wave- 
length as the trichromatic scheme and 
has summarized well the convincing evi- 
dence in support of this assumption. 
Let me present one additional line of 
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evidence which, I believe, limits the 
range of possible theories of retinal re- 
ceptor excitation. Blackwell and Black- 
well (1959, 1961) have shown that pa- 
tients with diseases and developmental 
abnormalities of the retina reveal the 
fundamental action spectra of the three 
classes of receptors required for the tri- 
chromatic scheme, since aberrant retinas 
seem to exhibit simplified color systems 
consistent with the view that one or 
more of the color receptors has been 
rendered inoperative. Most dramatic are 
the patients classified as blue monocone 
monochromats, who have spectral re- 
sponsitivity only in the region usually 
identified with the blue-sensitive color 
receptor. 

Assuming as we do that in this case 
we may equate the luminosity curve to 
the action spectrum of the blue color 
receptor, we may test the hypothesis 
that the cone photopigment is rhodopsin 
related. Figure 1 shows the expected 
rhodopsin-related photopigment derived 
by the method of Dartnall (1957) with 
the action spectrum from the patients’ 
luminosity curve, corrected for spectral 
absorption of the ocular media and re- 
computed in terms of an equal quantal 
spectrum, presented for comparison. The 
action spectrum is clearly more narrow 
than is to be expected on the basis of 
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Fic. 1. Comparison of inferred photopigment 
absorption for the pblue-sensitive cone mecha- 
nism (solid curve) with the computed absorp- 
tion curve for the rhodopsin-related photo- 
pigment with a 430 millimicrons maximum 
(dashed curve). 


a rhodopsin-related pigment. We have 
obtained strong suggestions that the ac- 
tion spectra of red- and green-sensitive 
receptors are also more narrow than is 
to be expected for rhodopsin-related 


photopigments. : 
These results can mean either that 
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Enoch’s waveguides produce a narrow- 
ing of spectral cutoff for color receptors 
or that McConnell is correct that the 
spectral responsivity of the color recep- 
tors is not at all determined by rho- 
dopsin-related photopigments, or both. 
Yet, these data provide no difficulty 
with regard to the trichromatic scheme. 
My summary of all that has been said 
is simply that the trichromatic scheme is 
probably essentially correct, but that we 
cannot as yet identify either the precise 
action spectra of the color receptors nor 
their physical or chemical determinants. 
However, Enoch and McConnell do pre- 
sent convincing evidence that the classi- 
cal view that the color receptors contain 
rhodopsin-related photopigments whose 
action spectra determine color response 
cannot be maintained without modi- 


fication. 
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The frequently assumed relationship between factor analysis results and 
primary dimensions of objects is seriously questioned. Demonstration 
analyses are contrived to show that simple structure factors may be quite 
complex with regard to the conceptually primary dimensions of objects 
from which measurements were obtained and to indicate that there 
is no necessary correspondence between the number of factors obtained 
and the number of primary dimensions of the objects. Recommenda- 
tions are made for a more realistic view of this valuable methodologic 


tool. 


Over the last decade and a half, a 
number of methodologists have argued 
the cause of factor analysis as a method 
for identifying and understanding the 
structure of nature. Beginning with 
Thurstone’s (1947) box problem a se- 
ries of demonstration analyses have been 
undertaken with the objective of show- 
ing correspondence between factors and 
“real characteristics” or “primary di- 
mensions” of the things being measured 
(Barlow & Burt, 1954; Cattell & Sulli- 
van, 1962). In a recent issue of the 
Bulletin, Cattell and Dickman (1962) 
once more have attempted to demon- 
Strate a relationship between factor re- 
sults and the “incontrovertably known 
factor structure’ of the measurement 
domain. In view of the persuasive na- 
ture of these demonstration analyses, it 
seems important that a dissenting voice 
be heard. 

Factor analysis has been and will con- 
tinue to be one of the most fruitful 
methodologic approaches available for 
psychological research. It provides a 
technique for reducing a large multi- 

variate set of correlated measurements 
to a smaller number of perhaps more 
meaningful uncorrelated variates. The 
results need not and do not have in- 
herent in them any necessary relation- 
ship to “real” or “primary” character- 
istics of the objects or persons being 


measured. 
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Demonstration analyses are easily 
contrived for almost any purpose. Pre- 
vious analyses have been loaded to yield 
results corresponding to preconceived 
primary dimensions. It is just as easy 
to load examples in such a way that 
results do not correspond to what we 
conceive to be primary characteristics 
of objects. Logically, the demonstration 
analysis has its proper place as a counter 
example. A single exception will refute 
the general argument, but no small num- 
ber of demonstration analyses can prove 
its truth. The important thing to con- 
sider is that if factor analysis results 
sometimes fail to correspond to primary 
dimensions of the objects being meas- 
ured such a failure may occur when we 
attempt to use it as a method for dis- 
covering the “real structure of nature” 
in areas where that structure is unknown. 


Simple Structure and Complex 
Factor Dimensions 


Demonstration analyses of boxes, bot- 
tles, geometric figures, cups of coffee, 
and balls have unerringly resulted in 
factors whose number and nature cor- 
respond to preconceived primary dimen- 
sions of the objects being measured. 
From all of this, what else can the stu- 
dent conclude but that factor analysis 
is a methodology capable of results 
which are somehow magically and fun- 
damentally representative of “reality?” 


| 
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The inference is certainly that one can 
take any set of complex measurements 
and come up with simple structure fac- 
tors which correspond to preconceived 
primary dimensions. This is just not so. 
Rotation to simple structure can be un- 
derstood as an elaborate approach to 
cluster analysis. It identifies clusters of 
tests which measure the same things, but 
there is no assurance that these “same 
things” are simple and primary dimen- 
sions. 

Consider a “book problem.” Meas- 
urements of height, width, and thick- 
ness were obtained on each of 100 books. 
Equations defining 12 complex meas- 
ures as linear functions of the conceptu- 
ally primary dimensions of height, width, 
and thickness are presented in Table 1. 
Tt will be noted that the first nine meas- 
ures are complex combinations of the 


«three “primary” physical dimensions and 


that the last three are pure measures. 
Intercorrelations among the 12 meas- 
ures are presented in Table 2. 

A principal axes factor analysis yielded 
three factors. Orthogonal rotation pro- 
cedures were employed in an attempt to 
obtain a simple structure solution dis- 
regarding the three pure marker vari- 
ables which were included in the analy- 
sis only to aid in interpretation. The 


TABLE 1 


~- 
Wricuts DEFINING 12 CompLex Book 
VARIABLES 


oe = 


10H + 10W + 20T 


Xa = 

i= 20H + 20W + 20T 
X, = -0 7H — 0.2W + 3.0T 
Xs = 0.8H 1.2W + 40T 
x; = 17H — 10W + 55T 
X: = sH 13W — 03 
Xa = 15H — 13W — 0.3T 
% = He 1.5W + 00T 
Xo = 10H + 00W + 0.0T 
Xn= 00H + 1.0W + 00T 
Xv = 00H + o.0oW + 1.0T 


Note. -H = height; W = width; T = thickness. 


loadings of the marker variables on the 
rotated factor axes were then calcu- 
lated. Loadings of all 12 variables on 
the three unrotated and rotated orthogo- 
nal factors are presented in Table 3, 
and the transformation matrix used in 
obtaining the rotated loadings is pre- 
sented in Table 4. Those who favor 
oblique rotations will note that, while 
book dimensions of height and width are 
substantially correlated, the complex 
measurements in this example were 
selected in such a way that orthogo- 
nal axes provide adequate representa- 
tion of the test vector configuration. 
Examination of the projections of the 
pure marker variables (10, 11, and 12) 
on the rotated orthogonal axes leads to 
the conclusion that, while the pattern of 
factor loadings reasonably approaches 
simple structure for the complex vari- 
ables, these simple structure factors are 
highly complex with regard to primary 
dimensions of height (Variable 10), 
width (Variable 11), and thickness 
(Variable 12). Rotated Factor A cor- 
relates positively with all three con- 
ceptually primary dimensions. Variables 
1, 2, and 3 have high and rather pure 
loadings on Factor A. Examination of 
the structure of these complex variables 
as shown in Table 1 indicates that vari- 
ables with high loadings on Factor A 
are complex linear combinations of all 
three conceptually primary dimensions. 
Examination of the loadings of the 
pure marker variables on rotated Fac- 
tor B indicates that height (Variable 
10) and width (Variable 11) have ap- 
proximately equal negative projections 
while thickness (Variable 12) has high 
positive projection on Factor B. Vari- 
ables 4, 5, and 6 have high loadings 0" 
this factor. From Table 1 it can be ob- 
served that these variables represent 
weighted contrasts between height and 
width, on the one hand, and thickness 
on the other. Once again, the factor 
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TABLE 2 
INTERCORRELATIONS AMONG 12 COMPLEX BOOK VARIABLES 
af: 2 3 4 5 7 8 9 10 11 12 
i 1.00 98 97 23 —03 —O1 14 —.11 —.16 82 80 61 
2 1.00 -99 05 —.21 —.19 22 —05 —.15 91 86 46 
3 100 —.01 ~—.27  ~—.25 -18 —.10 —.22 91 90 40 
4 1.00 96 97 —22 —.15 12 —.29 —32 91 
5 1.00 99 —14 —.01 28 —48 —.57 77 
6 100 —.23 —.10 19 —49 —.53 .79 
7 1.00 96 87 51 —.07 —.06 
8 1.00 96 235 —34 —Al 
9 1.00 07 —.52 09 
10 1.00 82 13 
11 100 0% 
12 1.00 


appears to represent a discrete set of 
variables which measure the same things 
in approximately the same way, but it is 
definitely complex with regard to the 
three conceptually primary book dimen- 
sions. 

Factor C apparently represents a 
weighted contrast of height and width. 
Examination of the pattern of factor 


idth dimensions, Thus, Fac- 
tor C also appears to be complex with 


regard to the conceptually primary book 
dimensions. 

What has objective factor analysis 
and rotation to an approximate simple 
structure achieved? It has identified 
groups of variables which correlate 
highly among themselves and which are 
relatively independent of other variables. 
The results depend entirely upon the 
pattern of intercorrelations among the 
variables, and these intercorrelations in 
turn depend, not upon variables which 
correlate highly measuring the same uni- 
tary primary dimensions, but upon rela- 
tive similarities in their representations 


TABLE 3 
LoADINGS oF 12 Boox Measures on THREE ORTHOGONAL FACTORS 

Unrotated h Rotated 

Io Ib Ul A B e 
1 a2 u 59 060 1.000 988 101 —.065 
n nd 098 1.000 993 —.079 —.012 
3 a : oe 035 1.000 983 —.145 —.070 
4 Sa 934 031 1.000 120 995 —.062 
5 —.587 798 133 999 —.133 994 071 
6 — 568 822 044 1.000 —.115 998 — 021 
7 29° 179 959 1.000 222 187 ‘058 
8 —062  —.215 975 1.000 —039  _ 084 '996 
9 =n DEA —.002 965 1.000 —.120 .200 pee 
10 944 042 328 1.000 884 __ 389 288 
11 959 037 —.280 999 8240 agg z 5 
p ‘057 992 112 1.00 519 ‘84 L a 
= : ee ean 
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TABLE 4 


TRANSFORMATION MATRIX FOR ORTHOGONAL 
ROTATION OF THREE Boox FACTORS 


876 — A478 — .066 
459 .880 —.124 
117 078 -990 


of the underlying primary dimensions. 
Variables may correlate highly which 
tend to reflect in about the same degree 
several different primary dimensions of 
the things being measured. In order for 


the simple structure criterion to result ` 


in factors which represent primary char- 
acteristics of the things being measured, 
it is necessary that only a minority of 
the variables which reflect one primary 
characteristic to a high degree also re- 
flect any other one primary characteris- 
tic to a high degree. When we do not 
know beforehand what the primary di- 
mensions are, it is difficult to justify the 
belief that factor analysis will somehow 
magically point them out to us. 


Number of Factors and Primary 
Dimensions of Nature 


It may be that we cannot count on 
factor analysis to result in factors which 
correspond directly to the “real” dimen- 
sions of the thing being measured. Are 
we still justified in assuming that there 
is some sort of fundamental relationship 
between the number of factors and the 
number of primary conceptual dimen- 
sions (whatever those are) of the things 
being measured? This problem of the 
number of factors is naturally very im- 
portant to those who view factor analy- 
sis as methodology fundamentally re- 
lated to the structure of nature. 

Cattell and Dickman use a theorem 
derived by Guttman (1954) which says 
that the lower bound for the number of 
factors is equal to the number of latent 
roots of size 1.00 or larger. In the fore- 
going example, three and only three roots 
exceeded 1.00. Thus, in at least one 


book problem we have come up with the 
expected number of factors. 

Consider Book Problem Number 2. 
Twelve complex measures defined as 
functions of the conceptually primary 
dimensions of height, width, and thick- 
ness are presented in Table 5. These 
12 complex measurements were obtained 
for each of 100 books. Intercorrelations 
among the 12 complex measures are 
presented in Table 6. 

A principal axes analysis was under- 
taken with unity in the principal di- 
agonal. Four factors with corresponding 
characteristic roots greater than unity 
were extracted. The four factors ac- 
counted for 98% of total variance. The 
roots associated with the four factors 
were 5.36, 4.29, 1.09, and 1.02, respec- 
tively. The loadings of the 12 measures 
on the four unrotated principal axes are 
presented in Table 7. Since interest here 
is in the number of factors, no attempt 
has been made to rotate the axes, The 
point is that in two different analyses 
of complex measurements of the same 
physical objects different numbers of 
factors were found, Thus, it appears 
that neither the number nor the nature 
of primary factors has inherent relation- 
ship to the true conceptual dimensions 
of the objects being measured. 

It should be recognized that the 


TABLE 5 


TweLve COMPLEX BOOK VARIABLES AS FUNC- 
TIONS OF HEIGHT, WIDTH, AND THICKNESS 


X: =H/H:W-T 

Xa = H — 0.2W — 0.2T 
X: =H / (0.2W + 0.2T) 
X. =W/H'W'T 
=W — 02H — 0.2T 
X. = W / (0.2H + 0.2T) 
X, =T/H:W-T 

X. =T — 0.05H 

X, =T/005H 

Xo =H-W:T 

Xu =H+W+4+T 
Xa = (H° + W° + 4.0T*)} exp (2) 
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TABLE 6 
INTERCORRELATIONS AMONG 12 COMPLEX BOOK VARIABLES 
1 2 3 4 5 6 7 8 9 10 ll 12 
1 1.00 -06 50 99 05 -28 02 —.59 -60 —45 —.21 —.14 
2 1.00 02 .09 -62 26 —87 —09 —.25 29 -80 82 
3) 1.00 44 —46 —.24 52 —69 —68 —.68 —.57 —.51 
4 1.00 13 35 03 58 60 43 15 08 
5 1.00 86 —.84 —15 —.25 24 81 82 
6 100 —53 —48 —.53 —17 39 42 
7 1.00 00 15 —38 —90 —.91 
8 1.00 98 86 33 27 
9 10000 750 192 
19 100 68 65 
11 100 99 
v2 1.00 


emergence of more than three factors in 
this example depends upon the inclu- 
sion of complex variables which are non- 
linear combinations of height, width, 
and thickness. The factor model as- 
Sumes that each complex measure is a 
simple linear combination of primary 
factor variates, but we have no control 
Over nor insight into the nature of com- 
Plexities involved in most practical prob- 
lems. Even Thurstone (1947) included 
some nonlinear combinations in his 
famous box Problem. In domains where 
we do not artificially construct our com- 
plex variables, nonlinear complexities are 


TABLE 7 
PRINCIPAL Axis Factors FROM 12 COMPLEX 
VARIABLES 
I II 
1 —39 —.70 ıs 58 
2 66 50 —51 zn = 
3 5.72 -39 ~49 _ 94 ion 
4 = 88 = 92 ae 59 100 
5 m = 58 31 — 91 99 
6 33 —.72 59 ga 99 
7 —8 48 14 11 94 
s 49 2 10 a7 1.00 
9 36 88 —.01 27 99 
10 77 48 —.19 -26 93 
11 97 —20 -mH 02 1.00 
12 95 —.26 —.14 -04 99 


| 


almost certain to occur and these will 
influence the number and nature of fac- 
tors obtained. For example, volume may 
be a meaningful complex book dimen- 
sion with three additional independent 
dimensions of height, width, and thick- 
ness also present. Cattell and Dickman 
(1962) include variables such as weight. 
For equal density objects, weight is 
directly proportional to volume which 
in turn is a nonlinear combination of 
height, width, and thickness dimensions. 


Further Comments concerning the 
Number of Factors 


Humphreys (1962) has described in 
precise terms the problem of multiple 
levels of factor results. He suggests that 
it is useful to conceive of at least three 
levels of factor generality, all of which 
can be represented under an appropri- 
ate orthogonal transformation, His dis- 
cussion appears to bring to formal fac- 
tor theory the concepts of hierarchical 
analysis presented by McQuitty (1960a, 
1960b) in somewhat different context. 
Guttman (1958) has also departed from 
the traditional school in Suggesting that 
it is most reasonable to Conceive of test 
Scores as being determined by a large 
number of sources of variance, 


The basic factor model represents the 


— Sana 
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common-part score, c, as a linear func- 
tion of r independent sources of vari- 
ance which we call factors: 


+ fry; 


c=fiy: + fye +... 


The vector of common-part scores on p 
tests for a single individual can be rep- 
resented in matrix notation in terms of 
p different linear combinations of the r 
factor variates. 


c F y 


pxl° pxrrxl 


Multiplying both sides by c’ = y’F’ and 
taking expectation yields the familiar 
equation 
R _ FEF P 
px Pp” pry kp’ 

where R is the matrix of covariances 
among the common-part scores (reduced 
correlation matrix) and F is the matrix 
of factor loadings. 

There is nothing in the basic factor 
model nor in the fundamental factor 
equation which requires that r be a 
small number. Suppose that we begin 
by writing equations for 30 variables in 
1,000 terms each, 


a =fu yi + fio Ye +... £12000 Y1000 


Ce = for yr + fee ye + 


. + + £2,1000 Y1000 


Cao = £3011 + fa0o2¥2 + -+ f50,1000¥ 1000 
Suppose that these equations do in fact 
represent the projections of 30 tests upon 
1,000 factors which are independent 
sources of variance in the population. 
We could from these equations generate 
a 30X30 reduced correlation matrix R. 


R . F  F 
30 x 30 =30 x 1000 ` 1000 x 30 


If we turn around and apply the 
methods of factor analysis to the re- 
sulting matrix which is based upon vari- 
ances and covariances derived from 


1,000 independent factors, we obviously 
would obtain fewer than 30 common 


factors. Thus, although it is true that 
we can factor R to obtain F and that 
we can then reproduce R from F, in gen- 
eral there is no reason to expect that 
we can begin by producing R from F 
and then turn around and obtain this 
same F by factoring R. It seems to the 
present writer that the fundamental fac- 
tor equation has led to a fundamental 
factor fallacy where this reverse logic has 
been applied. 

It is reasonable to conceive of every 
word in the language as representing an 
independent source of variance in the 
responses we observe. Change a word 
and the observed response changes. 
Thus, there are potentially many sources 
of variance in which we might be in- 
terested. Certainly, if we have 30 com- 
plex tests, each according to this view 
representing a large number of molecu- 
lar variance components, we might con- 
sider each measure to represent a 
meaningful and important independent 
composite source of variance. The 30 
scores might be factored in a meaningful 
way into 30 orthogonal variance com- 
ponents, into 10 orthogonal variance 
components, or into 2 or 3. It is con- 
ceivable that each of these analyses 
could prove useful and meaningful. Who 
is to say that this model is “correct” or 
that number of factors is the “true” 
number? If there are any true dimen- 
sions of things, we might do well to con- 
sider them conceptual and not inherently 
real. Multiple sets of concepts may span 
the same domain and provide useful al- 
ternative representation of that domain. 


Interpretation of Factors 


Does all of this mean that factor 
analysis is a false and worthless meth- 
odology? Certainly not. It promises to 
be one of the most useful research tools 
in social science and will realize that 


i i trip it of 
romise more surely if we s 
sgt and embellishments. Factor 
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analysis is a procedure for obtaining 
from a large set of correlated measure- 
ments a relatively small set of uncorre- 
lated, psychologically meaningful linear 
components which account for most of 
the variance in the larger set. The three 
most usual objectives are parsimony, 
orthogonality, and meaningfulness. De- 
pending upon the relative emphasis 
placed upon these different objectives, 
different methods of analysis and rota- 
tion should be considered. 

Simple structure is also a valuable 
concept in factor methodology. The ar- 
bitrary reference frame may be rotated 
for the purpose of displaying the major 
characteristics of the test configuration. 
Simple structure factors identify clus- 
ters of tests which correlate highly and 
which are relatively independent of other 
tests, Frequently we can, through ex- 
amination of the tests, associate with 
the factor a meaningful pre-existing con- 
cept, Sometimes a new concept may need 
to be formed to represent the associa- 
tion of items. In all of this, there is no 
need to assume that simple structure 
factors will correspond to any particu- 
lar set of fundamental dimensions of the 
objects nor, for that matter, to assume 
that such fundamental dimensions exist 
1n nature, 

Considering the results of the first 
book analysis, we see that meaningful 
concepts can be associated with the 
simple structure factors in spite of the 
fact that these concepts do not corre- 
spond to what we might consider to be 
primary physical dimensions of a book, 
Factor A is a composite of all three 
physical dimensions. It Corresponds to 

our psychological conception of the size 
of the book. An increase in any 
dimension increases the size of the 
book. Factor B represents thickness 


relative to page size. We might 
formulate a concept of “obesity” in 
books; that is, thickness relative to 
frame. Factor C represents “departure 
from squareness” in cover dimensons. 
While we may not be as familiar with 
these concepts and may not be in the 
habit of perceiving books in terms of 
size, obesity, and squareness, these inde- 
pendent dimensions appear quite mean- 
ingful. In spite of the meaningful fac- 
tors obtained, the writer would not 
encourage one to argue that we should 
stop considering books in terms of 
height, width, and thickness because fac- 
tor analysis has shown us that size, 
obesity, and squareness are the real 
primary dimensions, Alternative sets of 
concepts may prove quite useful. 
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PSYCHOPHYSIOLOGICAL INTERRELATIONS IN 
THE SOCIAL BEHAVIOR OF CHICKENS* 


A. M. GUHL 
Kansas State University 


Chickens establish a social order based on domination and subordination 
by pecking. The interindividual habits are relatively fixed as social 
inertia and channel various activities in an order of precedence. In- 
creased aggressiveness produced by treatment with an androgen is sup- 
pressed by these habits but becomes evident when an injected bird is 
matched with a total stranger or under a highly competitive situation. 
Successful revolts within the peck order are related to the degree of 
extinction of domination or toleration between individuals. Chicks 
treated with gonadal hormones show similar suppression of altered 
aggressiveness by social inertia. The passive dominance of males over 
females facilitates sexual behavior. Low ranking males have their sexual 
activity inhibited, whereas low rank among hens augments sexual recep- 
tivity. Physiological adaptation to social stress at low levels in the 
dominance order is brought about by the stimulation of the pituitary- 


adrenal cortex axis as shown by the hypertrophy of the adrenals. 


In the course of over 20 years of re- 
search in the social behavior of chickens 
there have been recurring evidences 
that the habits of domination and sub- 
ordination have influenced activities 
regardless of the physiological condi- 
tions of the individual birds. These 
behavioral phenomena have not received 
the attention that they appear to merit. 
Beach (1958) has made the comment 
that psychologists and endocrinologists 
have worked independently, with the 
result that the interrelations of their 
disciplines have remained a matter of 
speculation. It is the aim of this re- 
view to bring some evidence from these 
two fields for further consideration. 

The writer, a zoologist, has viewed 
behavior as the actions and reactions 
of the whole organism and as physi- 
ology outwardly expressed. Although 
conditioning, learning, and memory have 
a physiological basis, they are not well 
understood as physiological phenomena 
and therefore the term psychological 
has a useful connotation within the con- 
text of this review. 

1 Contribution No. 321, Department of Zool- 


ogy, Kansas Agricultural Experiment Station, 
Kansas State University, Manhattan. 


The observations and experiments 
cited herein have considered the follow- 
ing factors, among others, which may 
alter behavior: physical factors such 
as extremes of temperature; changes in 
the thresholds of response caused by 
fatigue and drive reduction following 
repeated elicitation, and from stimula- 
tion that results in what ethologists call 
“overflow activities”; conditioning and 
learning; maturation and aging, which 
include experience; seasonal factors, 
such as internal rhythms and endocrine 
effects due to changing day lengths; 
genetic basis for strain differences and 
individuality; and dietary deficiencies 
or avitaminosis. 

Social behavior has been defined vari- 
ously although with somewhat similar 
denotations (Guhl, 1962b). According 
to Carpenter (1942) “social behavior 
refers to the reciprocal interactions of 
two or more animals and the resulting 
modification of the individual’s action 
systems [p. 248].” Others prefer cer- 
tain limitations, such as occur within 
the same species. The type of social 
organization within a given species 15 
determined by the behavior patterns 


characteristic of that species. Since the 
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phenomena discussed here pertain to the 
common domestic fowl a brief descrip- 
tion of the social hierarchy, or peck 
order, is required. 


THE SOCIAL ORGANIZATION 


Agonistic behavior, which includes at- 
tack and escape reactions, forms the 
basis for the social order in chickens 
(Guhl, 1962a). When individuals are 
marked for identification, one can record 
all social encounters involving the one 
pecking and the one pecked. When 
these observations are tabulated it be- 
comes apparent that such domination 
is unidirectional; that is, peck rights 
are established, and that all the birds 
may be ranked according to the number 
of hens each pecks without any retalia- 
tion. This ranking has been called a 
dominance order, or peck order. In very 
small flocks the order may be a straight 
line hierarchy; however, pecking tri- 
angles are common and give the order 
of precedence a geometrical structure. 

Repeated pecking reinforces the domi- 
nance relations between all pair com- 
bination in the flock, Thus special habits 
are established between the individuals. 
In time symbolic threats, or the mere 

raising or lowering of the head may 

suffice for the maintenance of pair- 
reaction patterns. This reduction in 
frequency of pecking or threatening in 
time is shown in Figure 1 (intensity 
of agonistic behavior is not illustrated 
Siegel & Hurst, 1962). There are two 
points of particular interest; one, that 
the dominance order channels the flow 
of activity by way of precedence in com- 
petitive situations and thus precludes 
fighting and conserves energy; the other 
concerns the gradient of habits (or at- 
titudes) from that of a high level of 
domination to a characteristic response 
of subordination at lowest levels. Social 
mobility (changes in rank) is unusual 


and the relative fixity of response pat- 
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NUMBER ENCOUNTERS 
gS š 


30 O 90 170 10 180 210 240 270 300 330 380 
DAYS 


Fic. 1. The decrease in domination after 
assembly of flocks of hens as shown by total 
social encounters as a function of time (data 
from Siegel & Hurst, 1962). 


terns results in social inertia within the 
flock. The mutual interindividual ad- 
justments promote toleration which in 
turn facilitates the activities within the 
group (Guhl & Allee, 1944), It is the 
relation of social habits to the physiology 
of behavior which will be discussed, 


GonapaL HORMONES AND SOCIAL 
INERTIA 


Androgenic Treatment. Since the 
early studies of the endocrines suggested 
a relationship between androgen and 
aggressiveness, Allee, Collias, and Lu- 
therman (1939) injected testosterone 
Propionate into low ranking hens within 
several small flocks to investigate its 
effects. Several of the low ranking, 
treated birds staged successful revolts 
and rose in rank, even to top level. 
However, treated hens in one flock 
failed to show any mobility in the social 
order, for which no explanation could 
be given. Nevertheless, the significance 
of social inertia was recognized by the 
statement that “memory and habit re- 
inforce, and may entirely replace the 
aggressive behavior which is so impor- 
tant in the origin of a social order 
{p. 436].” During the postinjection 
period the hens that improved their 
rank tended to retain their new social 
Status. This was attributed to psycho- 
logical factors which retarded the rise 
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but now helped to maintain the newly 
acquired rank. Similar results were ob- 
tained by Allee and Foreman (1955). 

During the course of this experiment 
the treated birds were tested for ag- 
gressiveness in initial pair encounters. 
This test is similar to that used by 
Maslow (1934a, 1934b). Briefly, when 
two unacquainted hens are placed into 
a neutral area they readily establish 
dominance relations, and relative ag- 
gressiveness is indicated by the number 
of contests won. The treated hens were 
matched in pairs with normal controls 
and won most of these encounters, thus 
indicating that androgen may enhance 
aggressiveness in the absence of social 
inertia, 

Subsequently some of these birds were 
used in an experiment on social dis- 
crimination, One hen which received a 
pellet of testosterone propionate showed 
no social mobility, although she threat- 
ened two males when tested in a dis- 
crimination cage (Guhl, 1942, p. 145). 
When she was placed into a pen of 
strange hens she drove them about the 
pen (the opposite of what was expected), 
put on return to her own flock she re- 
sumed the behavior associated with her 
rank (Guhl, Collias, & Allee, 1945, 
p. 380). 

With a flock consisting of 16 indi- 
viduals, 4 each of 4 breeds, Williams 
and McGibbon (1956) implanted a pel- 
let of testosterone into 1 of each breed 
and at several levels in the social order. 
Observations over a 6-week period failed 
to show any social mobility. It was 
concluded that the hormonal effects 
were too gradual to overcome the al- 
ready established behavior patterns 
(social inertia). 

These results all suggest that recog- 
nition, memory, and habits underlie the 
social inertia that is responsible for a 
stable social order. Ill or injured birds 
may maintain their status provided that 


factors for recognition (e.g., combs) do 
not change abruptly. Successful revolts 
may occur among normal hens, but 
rarely in well-managed small flocks. 
To resolve these apparent discrepancies 
a study of the conditions under which 
revolts may be successful or unsuccess- 
ful was indicated. 

Recent experiments (Guhl, unpub- 
lished) have used several flocks of 10 
females. Peck orders were allowed to 
develop to stability and testosterone 
propionate was then injected daily into 
certain individuals. The hens to be 
treated were selected according to the 
frequency at which they pecked or were 
pecked. Competitive interactions were 
stimulated during timed observation 
periods by presentation of scratch grain 
and wet mash. Relative social stress was 
measured in rates of pecking and threat- 
ening, which could be used to deter- 
mine differences in social tension be- 
tween flocks as well as between 
individuals within flocks. In com- 
petitive situations the injected bird 
would readily assume a threatening 
stance, with varying intensity, which in 
turn would evoke an attack by its rank- 
ing penmates. If the social tension was 
high, then the revolt was unsuccessful, 
whereas there was a greater chance of 
success when toleration (tendency 
toward extinction) was noteworthy. For 
example, in one set of experiments in- 
volving four flocks during several 
months of observations, there were 292 
observed unsuccessful revolts of which 
215 were initiated by treated hens. Of 
only 20 successful revolts, 16 were by 
treated birds. , 

It may be concluded that social inertia 
tends to stabilize the social order irre- 
spective of changes in relative aggtes- 
siveness produced by treatment a 
androgen. Revolts may be successful i 
the subordinate indivi 


dual is stimulated 
sufficiently in 4 competitive situation 
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and the existing social relations are 
marked by toleration. 

Estrogenic Treatment. The effects of 
an estrogen on social mobility have re- 
ceived little attention. Allee and Collias 
(1940) injected estradiol into several 
hens in five different flocks. Only two 
shifts in dominance relations were ob- 
tained. This was in contrast with the 
higher upward mobility which resulted 
when treated with androgen, and con- 
trariwise the status displacement was 
downward with estrogenic treatment, 
Williams and McGibbon (1956) failed 
to obtain any social mobility by the 
implantation of pellets of diethylstilbes- 
trol. The analysis of recent experiments 
with estrogen (Guhl, unpublished) have 
not been complete. No social mobility 
was recorded. Treated hens showed 
little change in frequency of pecking 
although Some treated birds appeared to 
be less intense in their domination. This 
Modification of behavior promoted tol- 
eration of subordinate penmates rather 
than the evocative threatening stance 
shown after injection with an androgen, 


DEVELOPMENT OF SOCIAL ORGANIZATION 
IN CHICKS 


the developme 
the resultant peck order in chicks 
(Guhl, 1958). The age, in weeks, at 
which each chick established definite 
dominance relations was determined, 
Normal males began to set up a peck 
order at 5-7 weeks of age and com- 
pleted the dominance order at about 
8-10 weeks. Females initiated their 
social positions about 1 week later and 
required more time to complete the or- 
ganization. Chicks kept in isolation 
from hatching onward were assembled 
when their penned controls formed a 
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peck order and were able to establish 
dominance relations in a matter of hours. 
These results immediately raised the 
question of the relative effects of 
maturation and/or learning. 

Unisexual groups of males and females 
were set up as controls and experimen- 
tals from the same hatch with all the 
chicks in the treated group receiving a 
gonadal hormone from the second or 
third day after hatching. Groups of 
each sex which received an androgen 
formed their social dominance earlier 
and in only slightly less time (about 
1 week) than their respective controls. 
Males castrated (capons) at 10 days 
of age were slow in eliciting aggressive 
behavior and formed an order 4 weeks 
later than their controls. But capons 
treated with an androgen established 
pair reactions and their peck order 
earlier than the normal controls, Con- 
trary to expectancy, treatment with an 
estrogen resulted in both male and 
female groups showing a similar ad- 
vanced development of social organiza- 
tion. However, there was a marked 
difference in response behavior. Estro- 
gen-treated chicks showed little pecking 
but avoided readily, with the conse- 
quence that the social system was es- 
sentially a consistent avoidance order. 
It was somewhat surprising that the hor- 
monal treatments did not induce greater 
Precocity. Could the lag have been due 
to some form of social inertia, even 
among immature birds? 

To determine whether androgen treat- 
ment could induce an earlier formation 
of peck rights in the absence of learned 
Social responses, five sets of males from 
the same hatch were used (Guhl, 1958), 
Two groups were penned from hatching, 
with one serving as a control and the 
other as an experimental control re- 
ceiving androgen, Three sets were 
Placed in spatial isolation and also 
treated with androgen. The isolates 
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were assembled at 31, 41, and 51 days 
of age, and observed for the following 
7 days to determine dominance rela- 
tions. 

As may be seen from Figure 2, the 
experimental controls developed peck 
rights earlier than the normal controls 
as expected, and formed 53 out of a 
possible 55 dominance relations by the 
end of the observations (56 days), 
whereas the uninjected chicks estab- 
lished only 36. Treated isolates as- 
sembled at 31 days of age began to 
fight immediately and nearly completed 
their peck order (42 out of 45) by the 
end of a week, or at about the time that 
pen-reared normals began to organize. 
The other two groups of isolates intro- 
duced sexual behavior as a complica- 
tion. Androgen induces sexual behavior 
as well as aggressiveness and in the ab- 
sence of penmates no conditioning could 
be made in relation to either type of 


— NORMAL CONTROLS 
=== INJECTED CONTROLS 
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504 ;——-x ASSEMBLED 41st DAY 
+—— + ASSEMBLED Sist DAY 
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TOTAL NUMBER OF PECK: 


DAYS OF AGE 


Fic. 2. The total number of peck rights es- 
tablished at various ages among males reared 
as a group and others reared in spatial isola- 
tion and assembled at different ages. (The 
isolated chicks and one group reared together 
were injected with androgen. See Guhl, 1958.) 


behavior. Those assembled at 41 days 
displayed predominately sexual behavior 
and established only 20 out of 45 peck 
rights. Chicks assembled at 51 days 
apparently made some adjustment to 
the conflicting tendencies for either at- 
tack or for sexual approaches. None of 
the treated birds in flocks showed simi- 
lar sexual behavior. These results may 
offer some evidence for the need for 
spacing, in time, the emergence of be- 
havior patterns during development to 
permit social adjustments or learning. 
Treatment with androgen tended to re- 
duce this interval and resulted in a con- 
flict of sexual and dominance drives. 
In another experiment of this series, 
an attempt was made to determine how 
early in the life of normal chicks that 
agonistic behavior might occur if there 
were no social inertia. Four sets of 10 
male chicks each were used, with one 
set in isolation and the other three in 
pens, Every other day one chick from 
each of two groups was interchanged. 
Similarly, a chick from isolation was 
placed into the third penned group as one 
was removed and placed into isolation. 
These two patterns of rotation in the 
membership of the groups were con- 
tinued for 8 weeks from hatching. The 
data in Figure 3 are from daily observa- 
tions of 15 minutes per pen, All forms 
of agonistic behavior were not only at 
a higher frequency in the pen containing 
chicks from isolation, but also started 
at an earlier age. Chicks introduced 
from isolation were a stronger stimulus 
for attack than those rotated between 
groups, and differences in their postures 
were apparent. The results showed that 
social adjustments may be made early 
in the life of chicks, even without ex- 


ogenous androgen. 
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considered separately, 

Social Status of Males. To determine 
the effects of social rank among males 
on their sexual activity they were tested 
singly, and in rotation, with a flock of 
hens to obtain an estimate of their 
sexual drive (Guhl, Collias, & Allee, 
1945). All of the males in each of two 
flocks were then permitted to remain 
together with the same hens used for 
testing. A summary of some of the data 


a group and those rotated from group to 
8.) 


group. (The latter show social inertia. 


is given in Table 1. Although differ- 
ences in sexual drive between some of 
the males are suggested in the summary 
chart, there is also an indication that 
social rank influenced performance when 
in competition for mates, Of particular 
interest at this point is the case of 
Male III who was psychologically 
castrated in the presence of his social 
superiors. He also failed to tread the 
hens when all three of his ranking males 
were removed temporarily, but did so 
readily when placed into a pen with 


males, whereas a ratio of 1 
competitive among Legh 
when a more Teasonabl 
used (Guhl & Warren 


Orns, However, 
€ sex ratio was 
> 1946) similar 
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TABLE 1 


Mean Rates PER Hour OF SEXUAL ACTIVITY 
or Four Mares IN Eacm or Two FLOCKS 
TESTED SINGLY WITH THE HENS AND WHEN 
ALL Four MALES WERE IN THE PEN 


Pach over i Courting Treading 
Singly 4males Singly 4 males 
I 8.2 8.8 3.1 3.2 
II 8.2 7.1 4.6 1.3 
IV 7.2 64 2.5 2.5 
II 8.7 2.5 3.3 0.0 
v 24.3 11.3 9.5 1.6 
VI 42.1 9.6 8.4 1.4 
VII 21.1 3.2 7.1 0.2 
vill 413 10.0 9.2 1.1 


Note.—From Guhl, Collias, and Allee (1945). 


results were obtained. This extension of 
the study showed that the highest rank- 
ing male may also sire the most chicks 
in this polygamous species. 

It is of further interest that the 
passive dominance of males over 
females is associated with normal mat- 
ing. Young cockerels will fail to mate 
with older hens and are dominated by 
them. Since estrogen may induce sex- 
ual behavior in capons without any 
increased aggressiveness Over the cas- 
trate condition, such males were used 
to determine the importance of hetero- 
sexual dominance (Guhl, 1949). Treated 
capons mated quite normally with 
females over which they were able to 
rank but were driven by those that 
held higher status in the heterosexual 
peck order formed prior to treatment. 
Thus social status may facilitate or 
inhibit sexual performance. Further- 
more, it has been shown that the rela- 
tive acquaintance between the sexes 
influences the frequency and type of 
sexual behavior patterns displayed by 
both sexes (Guhl, 1961). 

Social Status of Females. Negative 
correlations have been found between 
the social rank of hens and the frequen- 


cies of submitting to a male and of being 
mated (Guhl, Collias, & Allee, 1945). 
Hens may crouch in a submissive pos- 
ture when strongly dominated by cer- 
tain of their superior penmates, and 
pseudomatings between hens have been 
reported under such circumstances 
(Guhl, 1948). The sexual crouch dis- 
played to a male is essentially similar 
but may be of a higher intensity. From 
such observations one might assume 
hens that rank low and are in the habit 
of submitting may display the receptive 
crouch more readily than high ranking 
females that are more inclined to domi- 
nation. Such a hypothesis can be tested 
by altering the relative intensities of 
these habits. 

The peck orders of three different 
flocks of approximately 30 hens each 
were determined (Guhl, 1950). Three 
or more males were caged within the pen 
and released singly for short daily 
periods. The number of crouchings 
observed for the hens in the top, middle, 
and bottom thirds of the social order 
was recorded. After several weeks of 
observation each flock was divided 
(subflocked) according to three levels 
in the peck order. This procedure did 
not alter the peck rights between the 
hens remaining in contact. Levels of 
domination and subordination were 
altered because the top third (as a 
group) now had two thirds fewer birds 
to dominate, the bottom birds had two 
thirds fewer dominating them, whereas 
the habits of the middle third were 
altered about equally between dominat- 
ing and being dominated. The same 
males were then rotated through each 
of these respective subflocks. Subflock- 
ing occurred when the laying cycle was 
at its peak as an attempt to balance the 
endocrine relationships. - 

The results for one flock are indi- 
cated in Figure 4. Under the conditions 
of subflocking a total reduction in rates 
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Fic. 4. The effects of habits of domination 
and submission on sexual receptivity as shown 
by subflocking that changed the intensity of 
domination in highest ranking hens and of sub- 
mission by hens in lowest social level. (After 
subflocking by three levels in the peck order 
the dominating females from the top third 
became more receptive. See Guhl, 1950.) 


of crouching was expected because only 
one male was used per small flock per 
day, whereas a sequence of three or 
more males was used in the large flock. 
In this manner the sex ratio remained 


t ) minating in females 

interferes with mating. 

SOCIAL STRESS ANp THE 
ENDOCRINE SYSTEM 


Although the peck order is an adjust- 
ment to the presence of penmates, indi- 
viduals at the lowest levels of the social 
order may be under stressors related to 
their rank, especially if situations 
become highly competitive. Since these 
birds are usually confined and under 


high population density for economic 
reasons, the effects of social stress are 
now receiving some attention. According 
to Selye (1947) animals may adapt to 
various forms of stress by the increased 
activity of the pituitary-adrenal cortex 
axis, which becomes evidegt by the 
hypertrophy of the adrenals, 

It has been shown that crowding, 
that is, the number of square feet per 
bird, results in an increase in the adrenal 
weight in hens (Siegel, 1959). However, 
there was no evidence of an acute hyper- 
trophy of the cells or lipoidal depletion 
and the reaction was wéll within the 


birds’ adaptive ability. Similar results” 


were obtained with males (Siegel, 1960). 
When social stress was introduced exper- 
mentally among males (Siegel & Siegel, 
1961) all individuals responded with 
increased adrenal weight when compared 
with controls in individual cages, The 
left adrenal, which is more Sensitive than 
the right in chickens, was heavier in 
cockerels which ranked low in the social 


order. Within the limits of the stress to 


which they were subjected, the males 


remained in the adaptive phase. It was ' 


postulated that these birds may be less 


liable to Psychological stimulation than 
mammals, 
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A REVIEW OF SOME RELEVANT LITERATURE 


G. S. TUNE 1 
School of Public Health, Harvard University 


A review of literature givin 
human Ss, 


g evidence of response preferences, mainly in 
is undertaken. The preferences are taken from experimental 


work in early psychophysical data, subjective estimates of “chance” se- 
quences, response mechanisms at the threshold, “gambling” situations, 
probability learning, and the influence of instructions on performance in 
probability learning. Explanations of response preferences are reviewed 
and the relationships between subjective uncertainty and preferences are 
explored with an attempt to give cohesion to a diverse body of experi- 


mental evidence. 


Fernberger (1913, 1920, 1930) was 
probably the first investigator to stimu- 
late interest in the “contrast effect,” a 
phenomenon which showed up as long 
Sequences of nonrandom responses which 
distorted the psychophysical function 
and affected the position of the point 
of subjective equality. Turner (1931) 
Confirmed Fernberger’s finding that sub- 
Jects tended to alternate their responses 
from Tesponse category to category and 
tried to explain this as a change in the 
effective value of the stimuli. Further 
experiments showed that the phe- 
nomenon was unaffected by the use of 
stimuli of equal Physical value (Arons 
& Irwin, 1932), the use of long inter- 
stimulus intervals (Preston, 1936a 
1936b, 1936c), or the use of cross-modal 
judgment situations, In all cases the 
predominant pattern of Tesponse was an 
alternation. 

Goodfellow’s (1938) examination of 
the Zenith Radio Foundation experi- 
ments in telepathy led him to reject the 
telepathic interpretation of the results 
in favor of one in terms of subjective 
preferences for nonsymmetrical patterns 


1 My thanks are due to the Department of 
Scientific and Industrial Research of Great 
Britain which provided the financial support 
for this work and to Stephen Griew under 
whose guidance this work was undertaken at 
the University of Bristol. 
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of response. He later suggested other 
factors which could affect the response 
choice (Gault & Goodfellow, 1940) and 
Proposed that the subjective concept of 
“chance,” which was different from the 
physical notion of the concept, would 
be important because subjects operating 
in accordance with this would be un- 
able to generate a random series of re- 
sponses. Skinner’s (1942) explanation 
was somewhat similar. He described 
alternations as a “tendency” operating 
within the subjective idea of chance 
where an alternation of responses would 
diminish the tendency and a repetition 
would strengthen it. 

This early work on the “contrast ef- 
fect” was the historical root of much 
of the contemporary speculation on the 
stochastic nature of behavior. The find- 
ings had implied that the phenomenon 
was of central rather than peripheral 
origin (Irwin & Preston, 1937) and that 
long Sequences of responses were likely 
to be Statistically interdependent in a 
complex manner (Irwin & Preston, 
1937; Preston, 1936b), two implications 
which have contemporary interest. 


Subjective Estimates of “Chance” 
Sequences 
Reichenbach’s (1949) suggestion, 


that mathematically naive subjects 


———— 


RESPONSE PREFERENCES 


would be unable to produce a random 
series of selections from a limited num- 
ber of alternatives, has been explored 
experimentally. Number preferences, 
“gambling” situations, and successive 
judgments where no actual judgments 
could be made have all been used. 

Subjects with some training in mathe- 
matics and probability theory were re- 
garded as mathematically sophisticated 
and were compared with subjects with- 
out such training by Chapanis (1953). 
Subjects wrote a total of 2,250 num- 
bers between 0 and 9 inclusive, in any 
order at their own pace. Subjective 
preferences for certain numbers were 
found but there was little agreement in 
their preferences. Repetitive pairs and 
triplets were avoided by unsophisticated 
subjects, while sequences of different 
numbers were preferred. Decreasing 
combinations were highly preferred but 
increasing combinations were not. Se- 
quences of six, seven, and even eight 
numbers were sometimes repeated at 
widely separated intervals. An autocor- 
relation showed that there was an asso- 
ciation between the nth and the #+xth 
digit, which was significant when they 
were separated by as many as four inter- 
vening digits. Sophisticated subjects 
tended to be more nearly random than 
the naive group. 

A similar task was set by Lincoln and 
Alexander (1955). Eight discs were 
arranged to form either a square or a 
circle and subjects were instructed to 
produce a random selection by touch- 
ing the metal discs with a stylus or 
calling out code names printed on paper 
discs. It was revealed that subjects did 
not select a random sample because of 
preferences for certain sequences of re- 
sponses. The preferences were influ- 
enced by the shape of the figure used 
and the method of response. 

Reichenbach’s claim (1949), that un- 
sophisticated subjects would be unable 
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to generate a random series because of 
insufficient runs of the same element, 
was specifically examined by Ross 
(1955). In this, subjects tried to pro- 
duce a random series by stamping cards 
taken from a file with symbols X or O. 
The files were arranged so that the 
proportions of each symbol varied from 
condition to condition. Bakan (1960) 
challenged Ross’ claim that the response 
series were random. Bakan obtained 
runs of responses from his subjects and 
compared the two halves of each set of 
data from each subject. Triplets of 
response correlations significant at the 
1% level were found. Preferences for 
one alternative or the other (‘“heads” 
or “tails”) gave r=.74, significant at 
the 1% level. On a chance expectation 
over 300 calls, 151 runs might be an- 
ticipated; in Bakan’s experiment the 
mean number found was 176, 62 of the 
70 subjects produced over 151 runs. A 
further analysis of the triplets of re- 
sponses (e.g. HHH, HHT, HTH, etc.) 
showed that the homogeneous patterns 
(HHH and TTT) accounted for 16% 
of the total responses, whereas the al- 
ternative patterns (HHT, TTH, THH, 
HTT, HTH, and THT) made up 84%, 
which was 9% greater than would be 
expected if responses were distributed 
equally over all eight possible triplets. 
The symmetrical patterns (HTH and 
THT) made up 36% of the total and 
the asymmetrical (HHT, TTH, THH 
and HTT) 48%. Assuming that all pat- 
terns would be used equally often, these 
figures should be 25% for the former 
and 50% for the latter, indicating that 
the symmetrical patterns are over-used 


by 11%. These results supported 


Reichenbach’s claim and since the €x- 
tterns was 


the response pa 
eemed difficult to ac- 


findings, unless they 
terms of the type of 


amination of 
very thorough it sı 
count for Ross’s 
were explainable in 
statistic used. 


G. S. TUNE 
288 


The results of these experiments are 
fairly clear. Subjects did not appear able 
to give a random selection from groups 
of two or more alternatives, primarily 
because of preferences for certain pat- 
terns of response. Alternative response 
patterns and nonsymmetrical patterns 
seem to be preferred to long runs of 
homogeneous responses, 


Response Mechanisms at the Thresholds 


Senders and Sowards (1952) pre- 
sented subjects with a long series of 
weak flashes of light at a constant in- 
tensity, and a tone of constant volume, 
and required them to report when the 
two stimuli occurred together and when 
they occurred separately. Instructions 
led the subjects to believe that they 
were making discriminations, but in fact 
the stimuli were always presented to- 
gether. Using an information measure 
developed by Newman (1951), Sen- 
ders and Sowards (1952) found that 
subjects preferred long series of iden- 
tical responses, a preference which de- 
creased randomness and showed a 
tendency towards first-order redun- 
dancy. The authors Suggested that the 
repetition of a response was easier than 
alternation, but the main conclusion was 
that when the “discrimination task” was 
impossible, stimulus determination of 
responses was not Possible, and they 
seemed to become response determined, 

Similar results in favor of response 
determination were found by Tune 
(1963). His subjects were required to 
state whether the gap in a Landolt-C 
was in one or other of two positions, He 
found that further away from the gap 
the degree of nonrandomness in the re- 
sponse series, as determined by the Mc- 
Gill (1954) statistic, increased. Fur- 
ther, responses became more response 

determined in that the relationship be- 
tween the previous response and the re- 
ponse increased. Contrary to Senders 


and Sowards (1952) and Howarth and 
Bulmer (1956), however, the predomi- 
nant pattern of response was an alterna- 
tion. The tendency suggested by Skin- 
ner (1942) was clearly shown. 

The assumption made in the psycho- 
physical methods, that responses are 
statistically independent of each other, 
was called into doubt by Senders (1953), 
Senders and Sowards (1952), Wert- 
heimer (1953), and Verplanck, Collier, 
and Cotton (1952). 

The general procedure followed by 
the authors was to ascertain their sub- 
jects’ threshold by the method of limits 
and then embody this value in long 
series of stimuli. Subjects reported 
whether or not this stimulus was seen. 
Howarth and Bulmer (1956) presented 
subjects with stimuli of threshold inten- 
sity at 4-second intervals, Reports of 
“seen” or “not seen” were taken and 
the time-ordered responses were ex- 
amined. Autocorrelations showed that 
the association between a response and 
subsequent responses disappeared very 
rapidly being almost completely gone at 
a lag of three. It had been suggested by 
Oldfield (1955, 1956) that fluctuations 
in visual sensitivity could account for 
such relationships as this, but Howarth 
and Bulmer (1956) were able to dis- 
count the suggestion. It was implied by 
the rationale of the experiment con- 
cerned with Oldfield’s (1955) claim, 
that the predominating pattern of re- 
Sponse was a repetition of the same re- 
sponse in long runs, This was contrary 
to what Tune (1963) found. 

_ McGill (1957) first required his sub- 
jects to identify four tones, after which 
the four tones were presented in ran- 
dom order at a constant intensity. Diffi- 
culty in discriminating the tones was 
introduced by varying the level of back- 
ground noise. The author reported that 
where the signal to noise ratio was be- 
low the average threshold, the response 
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series became nonrandom. This effect 
was greatest where only the background 
masking noise was present; it gradually 
declined as more intense tones were 
introduced. 

These experiments showed that where 
discriminations were made on the basis 
of adequate objective evidence, then a 
random response series could result. 
However, as the discriminations became 
more difficult or stimuli could not be 
discriminated response sequences tended 
to exhibit a considerable degree of non- 
randomness. The nature of this non- 
randomness was a source of dispute, 
tendencies towards frst- and second- 
order redundancy having both been 
found. Conklin (1954) drew a useful 
distinction between a judgment “set” 
and a guessing game “set.” Assuming 
that suprathreshold responses to an 
auditory discrimination task were not 
significantly related he claimed that 
response independence was reasonable. 
He did not, however, publish details of 
the difficulty of discriminating the tones 
used in his experiment. 

In a further attempt to justify the 
proposition that responses can at least 
in part be determined by stimulus 
events, Conklin and Sampson (1956) 
carried out a weight-lifting experiment 
with weights which differed from each 
other by as much as 64 grams or were 
of equal weight. They asked subjects 
to make the customary judgments in 
such a task and found that when the 
discrimination was difficult or impossi- 
ble, then response determination was 
at a maximum. 

Wertheimer (1953, 1954, 1955) found 
reliable shifts in both visual and audi- 
tory thresholds and explained them by 
saying that the successive samplings of 
thresholds were not independent. The 
day-to-day fluctuations were greater 
than the variations within 1 hour and 
the visual and auditory thresholds were 


289 


correlated, but because of the possi- 
bility of biased sampling, Wertheimer 
avoided speculating about a “central 
sensitivity factor.” 

The experiments by Day (1956a, 
1956b, 1957) were remarkable in that 
they yielded contrary results: random 
and nonrandom responses at the thresh- 
old. In one experiment (Day, 1956a), 
subjects detected a 1 jnd in a tone pre- 
viously determined to be at the 50% 
threshold. With interstimulus intervals 
of 1 and 9 seconds, Day found con- 
sistently random serial-threshold be- 
havior. A similar set of experiments 
(Day, 1956b), with interstimulus inter- 
vals from 1.6 to 10.6 seconds, showed 
a decline in the amount of nonrandom- 
ness for intersignal intervals of longer 
than several seconds, but some subjects 
showed marked, if patchy, nonrandom- 
ness, at the longer stimulus intervals. 
Using the visual threshold (Day, 1957), 
the same author made experiments simi- 
lar to those of Howarth and Bulmer 
(1956) with interstimulus intervals of 
2, 4, 16, and 32 seconds, Nonrandom- 
ness of responses was found to be sig- 
nificantly less at the longer stimulus 
intervals. The shorter stimulus intervals 
showed patterning of both a first- and 
second-order type of redundancy, but 
there was no evidence of consistency 
from subject to subject in the way that 
the responses were organized. 

Day (1956b) speculated that fatigue, 
boredom, wandering attention, physical 
discomfort, and attitudes toward the 
experiment introduced a certain amount 
of organization into the response series 
which may have accounted for the ap- 
parent periodicity of nonrandomness 10 
his experiments. It is notable also that 
such a view had been implied by Wel- 
ford (1953). pate. 

The most intensive investigations 
into the phenomenon under discussion 
were made by Verplanck and his cole 
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leagues who undertook an extensive ex- 
amination of the response mechanisms 
at the visual threshold. Verplanck, Col- 
lier, and Cotton (1952) examined sub- 
jects’ time-ordered responses to a weak 
flash of light previously determined to 
be at the 50% visual threshold. Two 
conditions were used in which stimuli 
were paced and self-paced. Autocorre- 
lations were calculated and it was re- 
ported that even with a lag of nine 
responses there was some degree of asso- 
ciation. The correlation, as in Howarth 
and Bulmer’s (1956) work, was a 


monotonically decreasing function of 
the lag. 


It was possible that determining the 
visual threshold by the ascending and 
descending method of limits could pro- 
duce a “set” or “expectancy” for long 
Tuns of the same response, an effect 
which might have been particularly 
Prevalent in well-trained subjects. In 
order to examine this, Verplanck, Cot- 
ton, and Collier (1953) presented flashes 
of light of various i 
order. This Procedure would have 
eliminated any pr 
had for respondin, 


duce different 
which experienc 
ordered procedu 
serial dependen f long 
runs of the same response. As the ex. 
periment proceeded, however, the thresh- 
old was raised in that there was a 
decreased frequency of Seeing the flash 
of light. This was later confirmed by 
Bakan (1955) who reported that as a 
visual vigilance task Proceeded the 
threshold rose. Verplanck, Cotton, and 
Collier (1953) concluded that although 
a rise in the threshold might produce 
nonrandom series of responses there was 
evidence that response-to-response de- 
pendencies could account for it at least 
in part. Brackmann and Collier (1958) 


found that the visual threshold deter- 
mined from an ascending series of 
stimuli was lower than that determined 
from a descending series; the difference 
between these was an increasing func- 
tion of the size of the step interval, the 
variability of the psychophysical func- 
tion being directly related to the size of 
this interval. This was interpreted as a 
product of an interaction of two tenden- 
cies: first, to alternate responses which 
was a function of the length of the pre- 
ceding sequence of responses of the same 
category; second, a tendency to repeat 
responses which was a decreasing func- 
tion of the size of the step interval and 
of the preceding response. 

Examining response chains at the 
visual threshold, Collier and Verplanck 
(1958) showed that if stimuli of high 
(suprathreshold) intensity or low (sub- 
threshold) intensity were interpolated 
into a run of stimuli of invariant bright- 
ness, the sequential characteristics of 
the responses could be disturbed. Ob- 
viously the suprathreshold stimuli 
should force a response “seen” and the 
subthreshold ones the response “not 
seen.” It was shown that the magnitude 
of interresponse association following 
these interpolated stimuli was less than 
that between Tesponses taken directly 
from the invariant series, This finding 
was of considerable importance since it 
indicated that actual stimulus differences 
Were likely to be ignored in favor of 
what might be called a preoccupation 
with responses rather than stimuli. 

Further evidence of this tendency was 
found by Verplanck and Blough (1958). 
They reported that responses were not 
Statistically independent, Tuns of the 


fluctuating “sensitivity 
Possible contributing c 
Collier (1 


” factor was a 
ourse, 
954) found no reduction in 


= 
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the amount of association between suc- 
cessive responses in binocular as com- 
pared with monocular conditions and 
concluded that the nonrandomness was 
at least, in part, of nonperipheral origin 
adding force to Irwin and Preston’s 
(1937) conclusions. 

In discussions of the possible mecha- 
nisms governing response behavior, 
Brackmann and Collier (1958) and Col- 
lier and Verplanck (1958) noted that 
the tendency to alternate was a feature 
of experiments which required the sub- 
ject to guess; this was typified as a sim- 
ple response-to-response dependency. 
The tendency to repeat responses was 
said to be more complex in that it was 
a response-outcome dependence and a 
function of two variables rather than 
simply one. The situation was probably 
not as simple as this, however, McGill 
(1954) demonstrated that 28% of the 
response information was due to an as- 
sociation between the stimulus and the 
response and the preceding response, a 
half of this (14%) being a result of 
interaction between the stimulus and the 
preceding response. This indicated that 
a response-previous-response depend- 
ency existed in forced-choice tasks, but 
as Collier and Verplanck (1958) noted, 
stimulus similarity was an important 
factor. The more difficult it was to dis- 
criminate between several stimuli, the 
greater was the likelihood of there being 
response-to-response dependency. Cross- 
man (1955) pointed out the need for 
an objective measure of discriminability 
and, from a communication theory back- 
ground, developed a “confusion func- 
tion” which gave such a measure. Ver- 
planck and his co-workers have not used 
this measure although it has been avail- 
able for some time. 

Considerations of response mecha- 
nisms at the visual threshold only, led 
Collier and Verplanck (1958) to sug- 
gest four empirical constraints on any 
theoretical model of response behavior. 


They said that the magnitude of asso- 
ciation was related to the similarity be- 
tween successive stimuli, was related to 
the general level of stimulus intensities, 
was a decreasing function of the inter- 
trial interval, and may be related to the 
number of categories of judgment. 

This was particularly a criticism of 
the model originated by Weiss, Coleman, 
and Green (1955) which appeared to 
oversimplify the elaborate system of 
dependencies uncovered by Verplanck 
and his co-workers. 


Response Preferences in “Gambling” 
Situations 


Bakan (1960) reported that 80% of 
his subjects called “heads” at the first 
toss of a coin. Goodfellow (1940) gave 
a similar figure, also showing that this 
preference had fallen to 57% on the 
second toss and 44% on the third. 
There is a considerable body of research 
in existence on this topic, in particular 
that of Cohen (1960) and other mem- 
bers of the psychology department at 
Manchester University, England. Only 
a small sample of the more relevant 
papers will be discussed here. 

Gambling behavior has been studied 
by Edwards (1953, 1954, 1956) who 
found that there were probability pref- 
erences. These preferences were shown 
mostly when the odds were equal. A 
further finding of relevance was that 
gamblers tended to stay with a winning 
strategy but when losing were more 
ready to change. 

No relationship has been reported be- 
tween intelligence and subjective esti- 
mates of probability, but Dale (1958) 
added the qualification that intelligence 
determined not the hunches the man 
had, but whether he chose to be guided 
by them. Whatever the intelligence of 
the gambler some patterning in the 
ting behavior was apparent. Dale a ) 
further demonstrated the tendency S 
subjects to overestimate the value © 
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events with a low probability and a high 
gain, and to underestimate the value of 
events with a high probability anda low 
gain, in an experiment with “long odds 
and “short odds.” A relationship was 
found between subjective probabilities 
and subsequent behavior. This was not 
simple and there was reason to believe 
that some of the subjects were relatively 
sophisticated and were likely to make a 
bet and then change their opinions about 
the wisdom of their choice. 
In a situation designed to look like 
a gambling task (selecting draws on a 
pseudo-Pakistani football coupon), it 
was shown that the selection of items 
demonstrated a kind of sequential de- 
pendence (Dale, 1958) due to “bunch- 
ing” and “spacing” effects which were 
too regular to have occurred by chance. 
It appeared that selections were made 
in accordance with some nonrandom 
system. 
Although gambling behavior is very 
modifiable, it seems to be strongly in- 
fluenced by the ideas that gamblers hold 


about the probabilities of the events they 
deal with. 


Probability-Learning Experiments 


In a gambling situation subjects are 
required to place bets on certain op- 
tions after considering the consequences 
of the outcomes of these options, along 
with the potential rewards or losses. 
Much the same can be done in proba- 
bility-learning experiments, but often 
the rewards are not financial; the sub- 
ject being given merely the knowledge 
that his choice was correct or otherwise. 

There are basically two types of 
probability-learning experiments. In 
both, series of events are presented ac- 
cording to some stochastic process with 

a variety of available protocols. In the 
first type of experiment, the subjects 
are presented with a run of events and 
are then asked to estimate with what 


frequency each event occurred. In the 
second type of situation, subjects are 
required to predict which of the events 
will next occur. 

Hornseth (quoted by Hake, 1955) 
used an experimental arrangement of the 
former type. Subjects observed a se- 
quence of two lights flashing: on some 
proportion of the sequence both lights 
were illuminated; on the remainder, only 
one—the proportions were varied from 
condition to condition. Plotting the rela- 
tive frequency of occurrences as esti- 
mated by the subjects against the objec- 
tive frequency of occurrences, a typical 
phi-gamma function was found attribut- 
able to an overestimation of the high and 
an underestimation of the low fre- 
quencies. 

Similar inaccuracies were found by 
Attneave (1953) when subjects esti- 
mated the frequency with which letters 
occurred in English. This was taken as 
evidence that even long experience, of 
a stochastic process, need not effect the 
subjective estimates of relative fre- 
quency of occurrence of the events. It 
is important to note that, in Attneave’s 
situation there were 26 events, in Horn- 
seth’s only two; nevertheless, there is 
some indication that Collier and Ver- 
planck (1958) were right in placing a 
fourth empirical constraint on any sto- 
chastic model of choice behavior. 

Using the second type of experimen- 
tal procedure with a variety of unbal- 
anced schedules with two events, Grant, 
Hake, and Hornseth (1951) found that 
whatever the structure of the event 
series, subjects initially predicted each 
event with equal frequency; in other 
words, they seemed to hold an equiprob- 
able hypothesis. This appeared in many 


experiments of this type (Grant, 1953; 
Jarvik, 1951; 


: Lincoln & Alexander, 
1955; Nicks, 1959; Senders & Sowards, 
1952 Tune, 1963). 


In experiments 
where the series of eyen 


ts departed from 
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the equiprobable state (Grant, Hake, & 
Hornseth, 1951), the response series gen- 
erated by the subjects gradually aban- 
doned the equiprobable hypothesis and 
approximated the statistical structure of 
the event series. . 

Jarvik (1951) presented subjects with 
lists of two words arranged in random 
order asking them to guess which word 
would next occur and then informing 
them which word did actually occur. 
One word held preponderance over the 
other in a variety of ratios (60%, 67%, 
and 75%). Typical probability-learn- 
ing curves were produced, that is, in 
the 75% condition, the asymptotes were 
reached more quickly than the asymp- 
totes in the 67% condition and these 
again more rapidly than in the 60% 
condition. It was noted that once the 
asymptotes were reached, the curves 
tended to decline somewhat. Brunswik 
and Herma (1951) described similar 
curves as having “a slow initial rise fol- 
lowed by a slow ‘paradoxical’ decline 
without reaching statistical significance.” 
The sequences of responses showed a 
positive recency effect; that is, long 
runs of the same response. 

This type of study was enlarged upon 
by Goodnow and Postman (1955). It 
was found that in a 100:0 (two-choice) 
condition the asymptotes were approxi- 
mated more quickly than in any other 
and a decline in performance of the type 
mentioned above was thereafter evident. 
Goodnow and Pettigrew (1955, 1956) 
and Goodnow, Rubinstein, and Lubin 
(1960) made further studies along the 
same lines but with the difference that 
events were first presented with one fre- 
quency balance in a specific pattern, 
were then changed to a random fre- 
quency distribution, and subsequently to 
the initial one or yet another. Subjects 
were thus required to learn an initial- 
response strategy, were then forced to 
abandon this because events were only 
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randomly predictable, and then had to 
assume the initial or another response 
strategy. 

The implication of the results of these 
experiments was that any disposition 
to look for sequences of events was an 
advantage over having no disposition. 
Goodnow and Pettigrew (1955) showed 
similar results on the same kind of ex- 
perimental design with highly redundant 
event sequences. 

From communication theory it can be 
shown that in a two-choice situation 
making events equally probable maxi- 
mizes average amount of information. 
The maximum can be reduced either by 
unbalancing the probabilities of occur- 
rence or introducing sequential depend- 
encies into the event series or both. 
This is what Goodnow, Rubinstein, and 
Lubin (1960) did. In agreement with 
previous findings, they reported that 
subjects who had experienced structured 
runs in the first set of events most like 
the run structures in the second set of 
events, showed the greatest speed in 
adapting from one to another. These 
authors also pointed out the need for 
an independent measure of discrimi- 
nability, which is much the same as that 
which Grant (1953) was looking for, 
and drew the conclusion that an impor- 
tant factor in the process of discriminat- 
ing sequences of events was the subjects 
interest in his own responses. This is 
the response-to-response dependency 
found by other authors and indicated 
that even when event series were pre- 
sented at suprathreshold values, subjects 
tended to be influenced by responses 
which they made earlier ina response 
series. This was perhaps inevitable be- 
cause it might be reasonably assumed 
that subjects were testing out & variety 


of response strategies, which required a 


certain level of disproof before they 


were discarded. 


Edwards (1956) specified a way in 
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which different types of response strat- 
egies, Or “hypotheses,” might operate. 
2 

He suggested that: 

ig hypotheses define classes of small hy- 
po mee Small hypotheses, which usually 
Epecify sequences of choices, cannot be cor- 
rect, and so are constantly being tried, aban- 
doned, and replaced with other small hypothe- 
ses [p. 187]. 


A big hypothesis could be that the events 
have a general frequency distribution. 
A small hypothesis, for example, could 
be that certain transitional probabili- 
ties applied within the context of the big 
hypothesis, and since there are very 
many of the former, it is likely that sub- 
jects would test a good many before 
abandoning one big hypothesis in favor 
of another. 

In a similar manner Nicks (1959) 
speculated about the way in which sub- 
jects approached _ probability-learning 
situations. He noted that subjects ini- 
tially appeared to hold the equiprobable 
hypothesis which was subsequently mod- 
ified, if it was proved to be inappropri- 
ate. Nicks (1959) also made the point 
that his subjects seemed to bring along 
to the situation certain a priori expecta- 
tions concerning the structure of the 
Sequence. These expectations were like 
the small hypotheses mentioned by 
Edwards (1956) and indicated that 
while holding a very general equiproba- 
ble hypothesis, subjects seemed to have 
preferences for patterns of events possi- 
bly reflecting their ideas of “chance.” 

Further evidence that patterning of 
events was of interest to subjects in 
probability-learning tasks was produced 

by Komorita (1958). He found that 
equivalent methods of collecting data 
yielded equivalent results in that the 
learning curves for equivalent methods 
were not significantly different. An 
exception to this generalization was the 
finding that subjects did not seem to 
respond to single signal occurrences as 
they did to patterns of stimulus occur- 
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rences even though the probabilities of 
the stimulus events were identical. This 
finding indicated that the shapes of the 
learning curves using Komorita’s tech- 
nique were not the same as the curves 
obtained by Humphreys (1939) or 
Jarvik (1951). Although the asymptotes 
were the same, the rates of learning 
differed, implying that patterned events 
produced a different rate of learning 
from that of learning single events. 

Although there have been some exper- 
iments using multichoice tasks (Detam- 
bel, 1955; Gardner, 1957; Neimark, 
1956), the majority of theories of deci- 
sion making have been derived from a 
consideration of two-choice tasks (An- 
derson, 1955; Bush & Mosteller, 1955; 
Detambel, 1955; Estes, 1950, 1955). 
Some authors have claimed a high degree 
of correspondence between the theories 
they propose and the empirical findings; 
but none of these, so far, have taken 
into account all the constraints proposed 
by Collier and Verplanck (1958). 


Influence of Instructions on Performance 
in Probability-Learning Tasks 


“Recency effects” (Jarvik, 1951) are 
patternings of responses or sequential 
dependencies which exhibit tendencies 
to alternate from category to category 
(negative recency, effects, which are 
100% redundant at the second order) 
and a tendency to repeat responses 
(positive recency effects, which are 
100% redundant at the first order). It 
has been Suggested, however, that these 
recency effects are an artifact of instruc- 
tions given to subjects. 

Goodnow (1955 ) hypothesized that a 
task said to involve chance would pro- 
duce recency effects but a similar two- 
choice task said to exclude chance would 
not. Two tasks were used: a gambling 
situation in which it was implied that 
chance was involved and a problem-solv- 
ing task in which it was im 


ý plied that no 
chance was involved. Negative recency 
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effects were found in the gambling task 
where, it iš of interest to note, some 
subjects consistently chose the event 
which was most frequently rewarded as 
predicted by Flood’s (1954) theory. 
This response pattern has been found 
rarely, the reason being (Hake, 1955), 
that subjects had a primary interest in 
predicting the infrequent event and this 
interest precluded the uniform predic- 
tion of the most frequent event. Fur- 
thermore, subjects were indifferent to 
eventual outcomes because of the sim- 
plicity of the probability-learning sit- 
uation, and “reactive inhibition” may 
have reduced the frequency with which 
subjects responded to the most frequent 
event. 

Van Der Meer (1960) argued that 
instructions suggesting that the sequence 
of events to be predicted was nonsta- 
tionary would produce a higher asymp- 
totic value of the predictions of the more 
frequent event than instructi¢ns that the 
tationary. He tested his 


process was sS 
argument in a two-choice situation and 


found that it was substantially correct. 
The author showed the importance of 
this finding for mathematical learning 
theories of the Bush-Mosteller and 
Estes-Burke type, which were based on 
the ‘assumption that positive recency 


ld predominate. This tend- 


effects wou n 
ency has been widely reported through- 
ture (Ho- 


out the experimental litera 
warth & Bulmer, 1956; Senders, 1953; 
Senders & Sowards, 1952), but there 1s 
considerable evidence that negative 
recency effects occur just as often 
(Brown & Overall, 1959; Fernberger, 
1930; Grant et al, 1951; Jarvik, 
1951). Van Der Meer (1960) proposed 
that the differences in asymptotic values 
found between the two conditions of 
instructions were mainly determined by 
differences in motivation which made the 
utility of a correct prediction greater in 
the case of instructions implying that the 
process was stationary. 


Similar results were found by Hyman 
and Jenkin (1956). In a two-choice 
experiment it was reported that subjects 
led to believe that events were entirely 
predictable tended to alternate an incor- 
rect response more than subjects told 
that the event series was random. 

The experiment by Senders and 
Sowards (1952) mentioned above pro- 
duced perhaps the most startling results 
showing the influence of instructions on 
choice behavior. If subjects were 
instructed that events would occur 
approximately 75% of the time together 
and 25% of the time separately, their 
responses showed this inequality. Ac- 
tual stimulus information was ignored 
in spite of the fact that both the flash of 
light and the tone were presented at 
suprathreshold levels. 

It appears from this and other evi- 
dence quoted that instructions are of 
great importance in choice situations 
because they can bias the results in a 
number of ways. Brown and Overall 
(1959): sounded a warning, that care 
should be taken in applying learning 
theories to experimental data especially 
where there was any likelihood of re- 
cency effects of a type contrary to those 
assumed. Similar warnings have been 
made by or implied by Overall and 
Brown (1957), Edwards (1961), Feld- 
man (1959), and Lindman and Edwards 
(1961). The paper by the latter workers 
reported the interesting finding that 
both positive and negative recency 
effects could be located in the same set 
of responses in a probability-learning 
situation, depending essentially on how 
long the task lasts. 


Grouping of Responses 
Findings 
There is considerable experimental 


evidence to support the claim that 
human reaction time to an event is a 
function of the information content of 
the event (Brown, 1960; Crossman, 


and Various 
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1953; Hick, 1952; Hyman, 1954), It 
is also known that the average informa- 
tion content of an event is reduced by 
making the series in which it is con- 
tained redundant in some way. Although 
“Hick’s law (Welford, 1961) has 
recently been called into doubt (Alluisi, 
Muller, & Fitts, 1957; Leonard, 1954; 
Mowbray, 1961; Mowbray & Rhoades, 
1959), the final status of the law has not 
yet been determined. 

In a probability-learning situation 
Hake and Hyman (1953) found that 
responses, examined as a function of 
preceding responses, showed that sub- 
jects responded to sequences of events 
including preceding predictions. Homo- 
geneous runs of correctly predicted sym- 
bols were more “adequate” than others, 
This type of work was continued by 
Senders and Cohen (1955) who intro- 
duced sequential dependencies into the 
settings of pointers on dials on succes- 
sive readings, They found that perform- 
ance improved when measured in terms 
of information transmitted or accurancy. 
Redundancy can be regarded as a safe- 
guard against error and misunderstand- 
ing in communication (Attneave, 1959), 
hence Senders and Cohen’s results are 
not entirely unexpected. It has been 
well established that language is highly 
redundant (Newman & Gerstman, 1952) 
and in situations where accurate com- 
munication is of importance as in air 
traffic control, it is not surprising that 
the messages passed are more redundant 
than the language in general (Fritz & 
Grier, 1955), 

Learning a language can be regarded 
as learning a complex system of events 
with their probabilities of occurrence, 
Bennett, Fitts, and Noble (1954) car- 

ried out a simplified version of this proc- 
ess. They used equally probable events 
which were generated according to a re- 
stricted random procedure, such that re- 


dundancies at various orders of estima- 
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tion were present. Since there were five 
events, the number of possible combina- 
tions was very large indeed. In spite of 
this, subjects learned digram sequences 
to a significant level, an additional find- 
ing being that the responses were more 
redundant than the stimuli because of 
pattern preferences. A second experi- 
ment demonstrated that subjects could 
learn both the preferred or nonpreferred 
sequences although initially the nonpre- 
ferred sequences presented diffculty. 
This might be interpreted to mean that 
initial preferences can be rapidly modi- 
fied. Trigram sequences were not 
learned in general, probably because 
there were 50 trigram patterns which 
would require subjects to retain a con- 
siderable amount of sequential material. 

It was suggested that learning engram, 
digram, and trigram patterns was 
inversely related to the total number of 
possible events. This would, so Bennett, 
Fitts, and Noble (1954) claimed, ex- 
plain the failure of their subjects to 
show significant learning of the trigram 
patterns. A contributing cause might 
have been that the interstimulus inter- 
val of 4 seconds was so long that it was 
difficult for subjects to retain much of 
the information in immediate memory 
and it would also have militated against 
the tendency to “group” sequences 
together., 

Grouping of responses and recognizing 
patterns was studied by Brown (1960). 
He wanted to know the most efficient 
way of grouping from 3 to 15 signals. 
The 3-choice, 7-choice and 15-choice 
situations were extracted from displays 
of lights which were presented singly or 
in combinations; combining sources 
allowed fewer lights to be used. At the 
3 and 7-choice levels there was no signifi- 
cant difference between the rate of gain 
of information from the two types of 
display. At the 15-choice level a sig- 
nificantly greater amount of information 
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gain was reported from the single-source 
display, and reaction time against infor- 
mation in the display showed a linear 
relationship. 

The importance of human ability to 
perceive and respond to patterns of stim- 
uli is of interest mainly to research 
workers who speculate about the proc- 
esses of information transmission, but 
it has implications for both the organi- 
zation of behavior and more specifically 
for the phenomenon of anticipation in 
skilled performance. The literature 
reviewed does not allow definite conclu- 
sions to be made in all cases. Often the 
evidence presented from different 
sources appears contradictory, but where 
possible this has been explained as a 
variation in interpretation or a result 
of the statistical techniques used. In 
many instances there are gaps remain- 
ing in the experimental evidence which 
are a hindrance to any attempted gen- 
eralizations, and it is here that extrap- 
olation must be used but with caution. 


Some Explanations of Response Pat- 
terning 


Much work has been done on animals 
in the familiar Y or T maze, where the 
sequentially ordered list of choices of 
pathway exhibited the effects under dis- 
cussion (Dennis, 1939; Dennis & Hen- 
neman, 1939; Heathers, 1940; Hunter, 
1914; MacGillivray & Stone, 1930; 
Tolman, 1925; Yoshioka, 1929). The 
behavior was often described as an “ex- 
ploratory tendency” (Tolman, 1925) 
and there were many attempts to explain 
this tendency in terms of work decre- 
ment or fatigue (see Solomon, 1948, for 
a review of the literature). A further 
explanation was in terms of reactive 
inhibition (Zeaman & House, 1951). 

Skinner’s (1942) explanation of the 
contrast effect in terms of a subjective 
concept of “chance” allowed the possi- 
bility that such tendencies are innate. 


297 


At that time alternation tendencies 
predominated in experimental findings 
in both human and animal performance; 
since then, the positive recency effect 
has been found, along with demonstra- 
tions that such effects might be artifacts. 
The general explanation Skinner offered 
did not deny that both positive and 
negative recency effects were compatible 
with such a concept. Obviously chance 
could mean response sequences which 
showed alternations to some subjects, 
and to others, repetitions of the same 
response. 

Solomon (1949) claimed that reac- 
tive inhibition could not explain the 
tendency to alternate. The main point of 
the argument was that reactive inhibi- 
tion would dissipate in time and thus 
long interstimulus intervals would re- 
duce the tendency. Glanzer (1953) 
offered a similar theory in terms of stim- 
ulus satiation. Bendig (1951) thought 
that Hull’s reactive inhibition postulate 
implied something other than that inter- 
preted by Solomon (1949). It predicted, 
for example, that rewarding or rein- 
forcing a response, such as a guess, 
would have the effect of increasing the 
probability of a repetition of that re- 
sponse on the next elicitation, and that 
the effect would be greater the larger the 
number of rewarded responses. Bendig 
(1951) found this to be so and preferred 
not to abandon the reactive inhibition 
theory. 

The production of nonrandom sequen- 
ces of responses, by whatever means, led 
Senders and Sowards (1952) to specu- 
late that since psychophysical methods 
assume the independence of successive 
responses, a good deal of the work done 
using psychophysical methods must be 
called into doubt, Conklin (1954) 
pointed out that in Senders’ (1953) 
experiment, subjects could only guess. 
Ina true psychophysical situation, there 
are supraliminal as well as subliminal 
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and threshold events; thus a judgmental 
set as opposed to a guessing game 
develops, especially where there are 
three categories of response. Conklin 
found that if real differences in stimulus 
values were used, then subjects could 
respond to them correctly. This implied 
that the type of experimental setup 
could suggest a method of response to 
a subject, or at least create a set to re- 
spond in a certain way, 

Weiss, Coleman, and Green (1955) 
developed a mathematical model from 
the data in which subjects were required 
to bisect a perceived angular extent on 
the basis of kinesthetic cues. Subjects 
tended initially to drift into constant 
error, and long series of bisections 
yielded nonrandom response sequences 
as a result of Positive-recency effect. A 
stochastic model for this type of response 
behavior was described as “a Markoff 
(first-order 


The implication of this was that in order 
to predict wi 


of “kinesthetic trace,” 
which was small, 


dent. In visual-t 
however, response sequences show much 
larger-range sequentia] dependencies, 
(Howarth & Bulmer, 1956). An inter- 
pretation of these two findings may well 
be in terms of “kinesthetic” and “yig. 
ual” traces, the latter having greater 
duration. 

The evidence suggests that, in general, 
as the distance between two responses 
in a response chain is increased, the 


correlation diminishes. If decision proc- 
esses are mediated by a central mecha- 
nism, it is hard to account for the lessen- 
ing of sequential dependence in a 
kinesthetic judgment task as opposed to 
a task which involves visual, auditory, 
or tactual judgments. Apart from the 
tentative explanation offered above, how- 
ever, a further point can be made. The 
extent of sequential dependencies in a 
response series may reflect the degree of 
uncertainty a subject experiences in 
making responses. If stimulus deter- 
mination is impossible, then a guessing 
game set must be assumed and the 
sequential effects are entirely the result 
of response-previous-response or pre- 
previous-response influences. At the 
other extreme, where stimulus determi- 
nation is entirely possible, then any 
sequential effects must be a reflection 
of sequential dependencies in the stimu- 
lus events. At points in between these 
extremes there is neither complete stim- 
ulus determination nor complete re- 
Sponse determination, In this region 
responses may be partly determined by 
events in both the stimulus chain and 
in the response chain, a situation which 
exists in threshold experiments, Unfor- 
tunately most research has examined 
response sequences at the threshold in 
terms of sequential dependencies in the 
response chain only. McGill (1957) has 
demonstrated that both stimuli and 
Previous responses can determine re- 
sponse behavior, 

Since responses at threshold usually 
be made, it 
inations are 


ssary for an 
The magnitudes 


e. It may be that 
ents required of 
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the subjects by Weiss, Coleman, and 
Green (1955) provided a dependency of 
the Markov type because the judgments 
were relatively easy to make; that is, 
there was sufficient information in the 
stimulus series to minimize the effect of 
response determination. In situations 
where discriminations or decisions are 
much harder to make, then more com- 
plex sequential dependencies are to be 
expected in the response chain. 

It is easy to criticize mathematical 
models of the stochastic kind because 
they tend to oversimplify the situation. 
Howarth and Bulmer (1956), however, 
have indicated, and Bulmer and Ho- 
warth (1957) have demonstrated, that 
models of behavior which try to take 
into account long-range sequential 
effects become very complex, as antici- 
pated by Arons and Irwin (1932). In 
their model of nonrandomness in thresh- 
old experiments (Bulmer & Howarth, 
1957), it was assumed that sequential 
effects were comparatively short-lived. 
They also suggested that: 

It is obviously good strategy (for S) to set his 
threshold low if the stimulus occurs often and 
high if it occurs seldom, since the chief dan- 
ger in the first case will be to fail to see a 
stimulus which is there, and in the second case 
to see it when it is not there. A tendency 
to repeat his previous response will achieve 
this object of raising the average threshold 
when the stimulus is rare, and lowering it when 
the stimulus occurs frequently [p. 8; italics 
added]. 

In comparison, experiments where avoid- 
ance of repetitions seemed to be the 
prevalent pattern of response, it was 
suggested as reasonable that the test 
stimulus would be greater than the 
standard stimulus on about half of the 
occasions. 

In such a case the best strategy would be to 
fix the threshold at zero and to leave it there. 
However, if there is any danger of a constant 
error resulting from the threshold being fixed 
some distance away from zero, alternation of 
responses becomes a good strategy, since it will 
have the effect of shifting the average threshold 
towards zero, provided, of course, that the 
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greater and less comparisons are presented 
equally often [p. 9]. 

If forced-choice judgments of the sort 
reported by Senders and Sowards (1952) 
can be regarded as comparisons then this 
explanation need not apply. Tune 
(1963) found alternation, but here there 
was the possibility of some objective 
evidence from which judgments could be 
made, whereas in Senders and Sowards 
(1952) and Senders (1953) there was 
not. 

The complexity of the theoretician’s 
task in building a mathematical model 
of choice behavior at the threshold, or 
otherwise, is evident. There are many 
variables in operation and there is a 
bewildering amount of experimental evi- 
dence emphasizing the importance of 
each. There are also a great many sta- 
tistical techniques in use for detecting 
the existence and evaluating the range 
and strength of sequential dependencies. 
Statistics derived from communication 
theory such as those described by New- 
man (1951) and Attneave (1959) de- 
scribe the sequential dependencies in 
sequences of events, but they cannot 
treat both stimulus and response events 
simultaneously or deal with interaction 
terms. The multivariate information 
transmission technique (McGill, 1954) 
and the symmetric uncertainty analysis 
(Garner, 1957) are perhaps the most 
powerful communication measures yet 
developed. Both are multidimensional 
—like the parametric analysis of vari- 
ance—are capable of dealing with sev- 
eral variables simultaneously, and can 
give measures of interactions between 
variables. The applications of such sta- 
tistics to the type of research discussed 
in this review are, it is felt, likely to 
yield the most important results. 
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A NOTE ON “ON SIMPLE METHODS OF SCORING 
TRACKING ERROR” 


NICHOLAS J. CARRIERO 
Systems Support Department, Bell Aerosystems Company, Buffalo, New York 


The article deals with the omission of mean and variability data in Poul- 
ton’s (1962) review. Implications of the error are discussed and the miss- 


ing data are supplied. 


Poulton (1962) has performed a most 
needed service in his review of simple 
methods of scoring tracking error, es- 
pecially for those laboratories that lack 
computer facilities for analyzing such 
data. There is, however, one oversight 
in the report. 

Data presented in Table 1, on page 
324, for errors in position at reversals, is 
broken down into right-left and over- 
shoot-undershoot components. While 
different values are given for the mean 
CE and the SE of the CE for each of 
the components, only one value is shown 
for the mean SD and the SE of the SD. 
This relationship suggests that the opera- 
tions, by which the right-left and over- 
shoot-undershoot error components are 
derived, are such as to result in a con- 
stant difference between them with the 
consequence that both components yield 
the same SD. 

A review of Poulton’s procedure for 
determining these errors indicates that 
while the same absolute values are in- 
volved in both errors, there are sign 
changes from one component to the 


other which are not necessarily consist- 
ent. Such conditions assure that the 3,? 
in the SD formula will be equal for the 
two measures but do not assure that the 
(3x)? will necessarily be equal. 

These considerations suggested that 
the previously reported data were either 
the result of a fortuitous occurrence or 
that additional data had been inadvert- 
ently omitted from the report. 

Poulton has informed me that the lat- 
ter is the case. The SDs presented for 
position error at reversals in the table 
are for the right-left component. The 
SD values for the overshoot-undershoot 
component, he reports, are: Preview 
Version—M=1.02, SE=.10; Slit Ver- 
sion—M =2.11, SE=.27. 

He also reports that the Slit mean was 
larger than the Preview mean for each 
of the 12 subjects (p<.001). 
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HYPOTH- 
ERIORATION AND FACILITATION 
RES IN SENSORY-DEPRIVATION RESEARCH 


CHARLES A. BROWNFIELD 
Mendocino State Hospital, Talmage, California 


2 opposing hypotheses have emerged from research in sensory deprivation: 
it impairs mental functioning and it facilitates mental functioning. Much 
evidence exists for either or both cases, but review of some of the litera- 
ture suggests that whatever the requirements of the human organism for 
varied external stimulation, mere reduction or increased patterning of 
input will not alone produce major disruptive or enhancing effects. The 
multiplicity of reported results make it appear unwise to conclude that 
sensory or perceptual isolation, as it is currently conceived, results in de- 
terioration or facilitation of mental function until more research is ac- 
complished. Evidence from learning studies, in which the disparate hy- 


potheses are most pronounced, stimulated the 
empirically the phenomena reported in a wid 


deprivation. 


The hypothesis that an organism re- 
quires not only stimulation but a con- 
tinually varied sensory input for the 
maintenance of normal, intelligent, 
adaptive behavior (Heron, Bexton, & 
Hebb, 1953) has been subjected to ex- 
perimental scrutiny by a number of the 
original investigators since the publica- 
tion and amplification of the initial 
Studies conducted at McGill University 
(Bexton, Heron, & Scott, 1954; Doane, 
Mahatoo, Heron, & Scott, 1959; Hebb, 
1955; Heron, 1957; Heron, Doane, & 
Scott, 1956; Scott, Bexton, Heron, & 
Doane, 1959). The findings of these 
Investigators seem to have confirmed the 
hypothesis of cognitive and perceptual 
deterioration resulting from exposure to 
a prolonged period of reduced and de- 
creased variation in the total-sensory 
environment, a condition concomitant 
with social and Perceptual isolation, 

Two studies in this context, both con- 
ducted at Princeton University (Vernon 
& Hoffman, 1956; Vernon & McGill, 
1957), have concerned themselves with 
learning in human beings as a measure of 
intellectual, cognitive and perceptual ef- 

ficiency, and the hypotheses in both of 
these were opposed to what might have 
been expected in view of the McGill 
work. These predicted that “sensory 
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Present attempt to integrate 
le range of studies in sensory 


deprivation,” a misnomer applied to the 
original isolation procedure, would facil- 
itate the learning process because of the 
absence of extraneous inhibiting stimuli 
interfering with the acquisition of new 
material. The first Princeton study 
(Vernon & Hoffman, 1956), using a 48- 
hour isolation period and four subjects, 
revealed a significantly better perform- 
ance for the experimental group, a find- 
ing consistent with the facilitation hy- 
pothesis, in the expected direction, But 
the second Princeton study (Vernon & 
McGill, 1957), utilizing a 72-hour period 
and a larger N of 10, not only failed to 
confirm the findings of the first experi- 
ment at the end of the longer period, 
but also failed to demonstrate a signifi- 
cantly better performance for the iso- 
lates after 48 hours, 

A third and more recent study at the 
University of Miami (Arnhoff, Leon, & 
Brownfield, 1962), utilizing the same 
learning task as the two Princeton in- 
vestigations but replicating the sensory 
and perceptual conditions of the McGill 
studies, failed to find significant facilita- 
tion or decrement in the performance of 
an experimental group with an N of 12. 
Neither deterioration nor facilitation 
hypotheses were Supported and the in- 
vestigators concluded that whatever the 
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requirements of the adult human organ- 
ism for external and varied stimulation, 
reduction or patterning of input will not 
alone produce major disruptive psycho- 
logical effects. Such results are the 
product of a complex interaction of per- 
sonality, anxiety, expectation, situational 
structuring, and amount and patterning 
of external sensory input. 

The phenomenon of facilitation by re- 
duction of variable sensory input has 
thus been shown to be untenable under 
the conditions imposed by the experi- 
mental procedures, although some quali- 
tative differences in subjects’ perform- 
ance were still observed. In general, 
there was less “disagreement” with the 
McGill findings in the Miami and second 
Princeton studies than in the first 
Princeton study, though agreement does 
not quite describe the results either. 

A possible explanation of the observed 
discrepancies does not appear so much 
a matter of depth or duration of sensory 
deprivation as it is of rejecting any sug- 
gested facilitation or deterioration of 
learning simply by reducing varied sen- 
sory or perceptual stimulation. The first 
Princeton study did not attempt to 
replicate the original McGill conditions, 
nor was the second essentially the same 
as the first. Had a replication been ap- 
proximated at the beginning, the results 
of the first learning study might have 
been more congruent with those of the 
second, and both could have been inter- 
preted as a refutation of either the 
deterioration hypothesis or the hy- 
pothesis of facilitation, as was the case 
in the Miami study. This outcome 1S 
strongly suggested by examination of the 
experimental conditions of the two 
Princeton learning studies, as compared 
with those at McGill where, incidentally, 
learning was not the primary process 
scrutinized. The degree of sensory 
isolation was more severe in the second 
Princeton experiment and more closely 
approximated that of the McGill cubicle; 
the differences, however, may be crucial 


ones for understanding the divergent 
results. 

In the second Princeton study, the 
duration of time equaled that of 
McGill’s. The experimental cubicle con- 
tained only a bed, but the first design 
used both a bed and a chair. The sub- 
jects in the first study had free access 
to both objects; therefore, the degree of 
movement restriction was liberal when 
compared with the second study and the 
McGill experiment where subjects 
were restricted to lying on the bed only. 

A basic, and perhaps critical, dif- 
ference was that in the two independent 
series of studies, one group of subjects 
spent the major portion of their time in 
total darkness, while the others always 
had a moderate amount of light-stimula- 
tion to the retina. In the second Prince- 
ton study, adjective lists were presented 
by tape recorder in the cubicle ante- 
chamber while the subjects sat blind- 
folded on the edge of the bed; previously, 
they had been let out of their cubicle for 
meals, toileting, and testing in the ante- 
chamber by the light of a 15-watt red 
bulb. By contrast, there was always 
diffuse illumination from a shielded 40- 
watt white bulb in the McGill study, but 
translucent goggles prevented patterned 
vision. 

The McGill cubicle was only sound- 
deadened, whereas the Princeton cubicle 
was soundproofed. There was constant 
noise from the ventilating equipment in 
the first situation, but great pains were 
taken to produce a considerable sound- 
loss (80 decibels) in the latter; here 
again is a condition which differed in 
such a way as to lead to some specula- 
tion about the necessity of having just 
minimal stimulation to insure organize! 
thinking. 


OTHER RESEARCH SUGGESTIONS 


ided some 
Doane et al. (1959) provides 
important clues for understanding the 


phenomenon of the occurrence of n 
lucinations which may, for the purp° 
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of the present discussion, be regarded as 
a rough indicator of the severity see 
organized thought processes, as well as 
the efficacy of the isolation procedure. 
They had some subjects wear both 
opaque and translucent masks; two 
cubicle subjects wore the opaque masks 
until an hour before the end of the ex- 
perimental period, when they were then 
fitted with the translucent mask worn 
by the others. Of the 11 subjects who 
wore the translucent mask continuously, 
8 developed hallucinations. When the 
two subjects, only one of whom had 
reported having weak hallucinations 
under the opaque condition, were given 
translucent masks, both had immediate 
and vividly strong hallucinations. Five 
subjects, who were among the most 
persistent hallucinators with translucent 
masks, were also put into complete dark- 
ness. They all reported an immediate 
increase in vividness of hallucinations, 
but within 2 hours there was a reported 
decrease, three having no more hallucina- 
tions and two having them greatly de- 
creased. When they were re-exposed to 
diffuse light, all of them reported that 
hallucinatory activity returned to the 
original level of intensity. It was further 
reported that two of four ambulatory 
subjects, who were not confined to the 
cubicle, but who did wear translucent 
masks, had hallucinations, 
The McGill experimenters suggested 
not only that diffuse light was only one 
factor in the hallucinatory phenomena, 
but that a disturbance in function may 
be greatest in the area in which restric- 
tion occurs, and there might also be 
some spread of effect to other senses, 
that is, auditory hallucinations may re- 
sult from reduction of visual stimulation, 
etc. In other words, there may be some 
interaction between sense modalities 
which depend upon external stimula- 
tion. If there is any validity in their 
observations, then under conditions of 
almost no stimulation, there should be 
less disorganization, as gauged in this 
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case by the occurrence and strength of 
hallucinations, than under conditions of 
invariant diffuse stimulation. Such an 
inference is not inconsistent with the 
outcome of the Princeton learning studies 
discussed here, since there was no 
reliably significant difference between 
the groups in either situation at the con- 
clusion of the experiment. Such was 
also the case in the Miami study in 
which, under the monotonous, nearly 
static conditions approximating the 
McGill study, no instance of hallucina- 
tions, delusions, or other unusual phe- 
nomena was reported. 

The results of the McGill studies and 
the two Princeton investigations raise 
some question about that part of the 
deterioration hypothesis which suggests 
only that some stimulation is necessary 
for the maintenance of normal, intelli- 
gent, adaptive behavior, since lack of 
stimulation in one case resulted in the 
reduction or disappearance of hallucina- 
tions, and in the other no significant 
differences between experimental and 
control subjects could be consistently 
demonstrated. In the sense that no 
stimulation elicits no response, the 
McGill hypothesis is correct; this is not 
intelligent, adaptive behavior, or more 
precisely, it is not behavior at all. But 
if no behavior is manifested, then no 
inferences of disorganization are war- 
ranted, since the organism becomes 
almost “inert” when nothing happens in 
the environment for a long enough 
period. It seems, rather, that a minimal, 
diffuse, monotonous stimulation is one 
of the primary requisites for disorganiza- 
tion of cognitive and perceptual func- 
tion; lack of such stimulation by the 
Severest deprivation of input possible 
may produce nothing but a lack of re- 
activity, or a kind of 
“suspended animation.” 
fact that monotony may 
but not a sufficient con 
organization, 
Miami group, 


psychological 
However, the 
be a necessary, 
dition for dis- 
is demonstrated by the 
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What seems really to be necessary for 
the maintenance of integrated behavior 
is varied sensory input. When hallucina- 
tions reportedly occur, or at least when 
they are of maximum vividness, some 
visible input is always present from one 
or more sensory sources. Hallucinatory 
activity may only be an attempt by the 
particular individual to differentiate and 
to structure the input so that it has 
content and meaningfulness, even if the 
resulting organization is bizarre because 
of reliance on internal frames of reference 
in the absence of external ones. 

Silverman, Cohen, Bressler, and 
Shmavonian (1958) described a situa- 
tion during a sensory-isolation experi- 
ment of short duration in which the sub- 
ject, a female psychology student, was 
in a totally dark cubicle. Inadvertently, 
the black paper covering the observation 
window was moved so that a crack of 
light became momentarily visible to the 
girl. She immediately reported that she 
had perceived what she thought was a 
Rorschach card and began elaborating 
an intricate delusionary system around 
this accidental, fleeting stimulus. It was 
pointed out that this normally negligible 
incident was sufficient to produce a com- 
plex organization of associations about 
a topic with which the subject was “ego 
involved.” If one isolated stimulus 
against the background of monotony, or 
no stimulation, can evoke such complex 
behavior, then a continually varied sen- 
sory environment would undoubtedly 
seem necessary for the maintenance of 
such behavior. When variation is lack- 
ing, but a constant input is present, 1n 
spite of adaptation the organism “be- 
haves” a little differently than if there 1S 
relatively no stimulation whatsoever ; 
relatively little, if anything, happens. 

Studies in which severe reduction of 
sensory input is achieved by immersion 
in water heated to body temperature 
(Lilly, 1956; Lilly & Shurley, 1958) 
have demonstrated that hallucinations 
and disorganization of thinking ensue 


with such rapidity that, by the end of 2 
or 3 hours, the subject finds it difficult 
to continue. This is reminiscent of the 
increased activity reported by the McGill 
subjects who were placed in darkness 
after being in diffuse stimulation; the 
hallucinations began to disappear with 
the passage of a time period roughly 
equivalent to that of the water-tank ex- 
periments. In both cases, the rapid 
change from a condition of stimulation 
to one of relatively no stimulation re- 
sulted in increased hallucinatory activity, 
that is, disorganization of thinking. The 
change, in itself, may represent a vari- 
ability of stimulation to which the 
organism responds with greater effort 
and energy by attempting to structure 
perception; when, finally, there is noth- 
ing to perceive, the situation becomes 
threatening. The process of deafferentia- 
tion may be so overwhelming that it 
becomes a threat to one’s very experience 
of existence, a state which, in the final 
analysis, may only be determined by the 
ability to respond to stimuli. 

Other supporting evidence for change 
and variation being essential for or- 
ganized functioning is the light-deprived 
subjects’ responses upon being re-exposed 
to light (Doane et al., 1959); hallucina- 
tory activity immediately reappeared 
with the original vividness. Again, this 
may be a kind of homeostatic attempt 
to structure perception in the presence 
of stimulation, but, falling short of its 
mark because of the lack of anything to 
differentiate. Time distortions, too, have 
been reported, so that the tendency ap- 
pears, generally, to be an underestima- 
tion of the duration of isolation (Lilly, 
1956; Lilly & Shurley, 1958; Silverman 
et al., 1958), an expected result if no 
progression of temporal events 15 per- 
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ing stimulation, not with actual 
oe Bs or reduction of stimulation; 
in this sense, they were imposing a pat- 
terning of an extreme sort, No one to 
date has ever succeeded in depriving a 
human being entirely of visual, auditory, 
tactual, kinesthetic, and olfactory stim- 
ulation simultaneously (to mention only 
some of the variables which are capable 

of being externally controlled); all of 
the studies cited in the literature have 
only “monotonized” and reduced the 
intensity of stimuli or increased the un- 
certainty of stimulus situations, but have 
not eliminated them completely. Ex- 
ternal stimulation was always present in 
the McGill cubicle in the form of low 
illumination, constant noise from the 
ventilating system, and no effective pre- 
cautions against gross movement. While 
normal levels of sensory, perceptual, and 
social stimulation were severely reduced, 
it does not seem appropriate to call this 

“deprivation,” that is, no stimulation 

whatsoever. Neither does it appear to 

be a case of “reduced” patterning of 
stimuli, but more likely “increased” 
patterning. 

What the water immersion experi- 
ments (Lilly, 1956; Lilly & Shurley, 

1958) did accomplish appears to be a 

relatively greater degree of stimulus re- 

duction and decreased variation, espe- 
cially exteroceptively and Propriocep- 
tively, not an absolute reduction or 
elimination, as implied. In the respirator 
experiments (Mendelson & Foley, 1956) 
too, stimulation was not absent, but the 
variety and intensity were markedly 
reduced and made so monotonous that 
the situation was not unlike that of the 
McGill cubicle or Lilly’s water tanks; 
no actual deprivation seems to be in- 
volved, except in degree. 

In view of all the methods thus far 
reported to produce sensory, or per- 
ceptual isolation (or invariant stimula- 
tion in the deprivation sense), it would 
appear that only two basic procedures 
are available to researchers in this area. 


The first consists primarily of decreas- 
ing the variation of stimulus input toa 
near-static level so that the effect is that 
of imposing a monotonized structure on 
the environmental stimulus complex; 
this is what has been done in most of the 
experimental studies reported. The 
second, and perhaps more difficult 
method, is to deprive the subject of 
stimulation in one or more sense 
modalities so that while neurophysio- 
logical receptor systems remain intact, 
they become nonoperational because of 
the lack of stimulation. Since no stim- 
ulation elicits no response, there is reason 
to suspect that these two different proce- 
dures result in qualitatively different out- 
comes. Attempts to produce this kind of 
a situation have been Partially success- 
i clative reduction of stim- 


as been achieved without 
the complicating side effects of drug 


anesthesia or amputation. Somewhat 
More success attends deprivation of 
visual and auditory input because of the 
ease with which light and sound stim- 
ulation may be experimentally dimin- 
ished with appropriate conditions and 
equipment (Doane, 1955; Hebb, Heath, 
& Stuart, 1954; Nissen, Chow, & 
Semmes, 1927; Silverman et al., 1958), 
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during the focus of attention (Hernan- 
dez-Peon, Scherrer, & Jouvet, 1956). 
Magoun (1958) has, in fact, shown that 
conceptions of brain organization have 
been enlarged by identification of non- 
specific neural mechanisms lying between 
sensory and motor systems, and that 
reciprocal ascending and descending con- 
nections between those lying in the brain 
stem and wide areas of the hemispheres 
are involved in arousal to wakefulness 
and alerting to attention. Moruzzi and 
Magoun (1949) and French (1957) 
have identified an area in the brain-stem 
reticular formation which responds in the 
same nonspecific way to all afferent 
stimuli from different receptor sources. 
This response is simply to arouse the 
brain, not to relay any specific message; 
its signals are projected to the entire 
cortex rather than to any one sensory 
center. This reticular activating sys- 
tem, or RAS, as the investigators called 
it, is selective to specific stimuli, that is, 
it makes differential responses to sudden 
or even continual sensory input, as if it 
had the ability to discriminate among 
stimuli. If the RAS does not function 
normally, as when there is actual tissue 
damage, a state of consciousness becomes 
impossible; the intact system, however, 
can maintain a wakeful state, even 1m 
the absence of a cortex. Here, “wakeful- 
ness” is not synonomous with “conscious- 
ness,” since it is possible to be awake, 
yet conscious of nothing, as is a new- 
born infant, or even a subject in an isola- 
tion cubicle where there may be almost 
nothing to perceive or differentiate. 

In other words, while a stimulus 
evokes or guides a specific bit of be- 
havior, it also serves the nonspecific func- 
tion of maintaining a normal state of 
arousal through the RAS. Ina condition 
of stimulus improverishment, decreased 
variation, or, if possible, in more extreme 
reduction, the sensory impulses, which 
normally traverse and activate the 
reticular system, are, operationally, non- 
existent (or markedly reduced), thus 
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producing an effect akin to that which is 
achieved by a lesion of the brain stem, 
that is, deafferentiation. Since an actual 
lesion would naturally make it impos- 
sible to study the consequent effects on 
perceptual and cognitive functioning, the 
method of experimentally controlling the 
approximate input of external stimula- 
tion by the isolation procedure makes it 
possible to study the effects on intact 
subjects. Besides, as Bexton et al. 
(1954) so poignantly observed, college 
students, the usual population studied 
in this context, are understandably 
reluctant to have their brain stems cut. 
Research conducted at Duke Univer- 
sity Medical Center was concerned with 
some rather specific neurophysiological 
reactions of human beings in low-sen- 
sory-input environments and have 
yielded much valuable information about 
resting catecholamine and hormonal 
secretion rates, central nervous system 
activity, peripheral vascular activity, 
respiration, liminal discriminations, and 
other functions (Cohen, Silverman, & 
Shmavonian, 1960a, 1960b). Using 
Witkins’ Rod-and-Frame procedure and 
Machover’s Draw-a-Person Test to dis- 
criminate two distinct groups, Cohen, 
Silverman, Bressler, and Shmavonian, 
(1958) have also demonstrated signifi- 
cantly different responses from subjects 
classified as “field dependent” and “body 
oriented” (field independent) ; these dif- 
ferences appear to be related to char- 
acteristic modes of interacting with the 
external environment by reliance upon 
external or internal cues for a frame of 
reference. It was suggested that individ- 
uals who relied on field cues react dif- 
ferently than body-oriented subjects 1n 
a situation in which field cues are 
we R Field-dependent 
minimized or lacking. +1 d less 
subjects consistently displaye Ta 
ability in discriminating E em 
remained more centrally ey a m= 
to move around more, appearé pa 
ogled more with feelings 
fortable, strugs 4 re more 
and fantasies, Or denied them, we! 
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suspicious, projected internal percepts 
more frequently, and, when interview 


rouped to get a rough 
oe a a fnetion” showed a 
a degree of disorganization. 
j There was also some suggestion that 
the ipdividual differences reflected in 
body-field perceptual relationships may 
not only be related to responses in an 
environment that exaggerates these dif- 
ferences and leads to more arousal in one 
group, but the results may also reflect 
differences in neurohumoral (noradrena- 
line) and physiological variables which 
may be either a function of body-field 
differences or a parallel expression of 
some underlying central nervous system 
difference between the two groups 
(Cohen et al., 1960b). 


EARLY EXPERIENCE 


The literature abounds with reports 
of experimental investigations designed 
to study the deleterious effects of early 
environmental conditions on later be- 
havior (King, 1958). Most of this work 
has been done with animals, mainly be- 
cause of the obvious advantages involved 
in studying organisms whose rate of ma- 
turational development is relatively 
faster than that of human beings. At- 
tempts at human studies are, for the 
most part, necessarily restricted to ob- 
servational methods, case histories, and 
other types of ex post facto designs be- 
cause of the difficulties inherent in con- 
trolling conditions of early experience 
or of manipulating the environment 
(Davis, 1940; Dennis & Dennis, 1940; 
Hill & Robinson, 1929; Melzak, 1954; 
Spitz, 1954, and others). Generally, 
however, most experimental efforts with 

animals or observational methods with 
humans involve the impoverishment of 
stimulation at various developmental 
stages in order to assess the later differ- 
ential effects of such treatment. It is 
important to note the emphasis placed 
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upon the imposition of the deprivation 
conditions at a stage in the organism’s 
development when it has not fully ma- 
tured. On the other hand, sensory-de- 
privation research is concerned primarily 
with the fully matured adult’s current 
response to stimulus reduction, decreased 
variation, or absence, of input. In many 
respects, the results of early-experience 
studies and sensory-deprivation research 
are comparable, that is, the cognitive, 
perceptual, and motivational changes re- 
ported are similar, but not precisely the 
same. 

Animals exposed to varying condi- 
tions of deprivation, restriction, or iso- 
lation at different chronological ages 
may manifest entirely different behav- 
ioral responses, depending upon the par- 
ticular developmental stage at which they 
undergo the treatment (King, 1958). 
In sensory-deprivation research, the or- 
ganism’s ability, or inability, to adapt 
to the stresses of the contemporary stim- 
ulus situation, or to no stimulation at all, 
is of primary concern, and reports of the 
consequences of such stresses are very 
consistent with one another, depending, 
of course, upon the type of deprivation 
employed. Whereas early experience al- 
most always appears to result in perma- 
nent, usually deleterious, alteration of 
what would otherwise have been typical 
behavior for the particular species under 
consideration, sensory or perceptual im- 
poverishment, or invariability, seems to 
have its greatest effect upon the mature 
organism at the time it is being subjected 
to these conditions; perserveration of ef- 
fects vary with the intensity and dura- 
tion of the isolation, as well as with 
the individual subject but no permanent 
or lasting changes have ever been relia- 
bly reported. This is not to say that, 
given specific stimulus conditions in suffi- 
cient amounts for undetermined dura- 
tions and with appropriate subjects, last- 
ing deteriorative or facilitative changes 
are not found; but it may very well be 
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that, once we succeed in establishing a 
reliable body of knowledge about the 
phenomena and learn to control them, 
permanent changes may be experimen- 
tally accomplished. So far as is now 
known, perseveration of effects seem to 
last, at most, 6 weeks (Bexton, Heron, 
& Scott, 1954), but, if at all, more likely 
a matter of hours or days (Arnhoff et al., 
1962; Cohen et al., 1958; Doane et al., 
1959; Heron et al., 1956; Lilly & Shur- 
ley, 1958; Mendelson & Foley, 1956; 
Vernon et al., 1956, 1957). 

A notable exception to the idea of 
cognitive or perceptual change being de- 
leterious and of short duration after iso- 
Jation is the claim of increased recep- 
tivity to the benefits of psychotherapy 
and concomitant improvement of psy- 
chotic patients who were exposed to con- 
ditions approximating the McGill study 
(Azima & Cramer, 1956; Harris, 
1959). The efficacy of the isolation 
procedure in this case, and its relation- 
ship to the improvement or facilitation 
of recovery from a psychotic state may 
be questionable for the obvious reason 
that no conclusive evidence that any 
change observed was the direct conse- 
quence of the isolation. Practical ex- 
perience in the clinical setting, however, 
has frequently led to the use of restraint 
or isolation of some types of agitated pa- 
tients. The results of restraint or isola- 
tion would suggest that such procedures 
tend to reduce confusing internal per- 
ceptive and cognitive activity, bringing 
about a state of relative calm, with in- 
creased responsiveness to the ensuing 
stimulation of psychotherapy- Tt is in- 
teresting to speculate that a disturbed 
individual may very well become almost 
“normal” in his response to isolation by 
becoming more tractable and receptive 
to measures which are known to be ef- 
fective with more normal people. Until 
we know more about sensory and percep- 
tual isolation, however, such a notion 
must remain within the realm of specu- 
lation only. 


INDIVIDUAL DIFFERENCES 


That individual responses may differ 
according to personality differences has 
been amply demonstrated by the fact 
that some subjects in isolation show one 
kind of response, for example, hallucina- 
tions, disorganization, performance defi- 
cits, and others show an entirely different 
kind of response, for example, no hallu- 
cinations, little, if any, disorganization, 
and, perhaps, some facilitation of per- 
formance. Not all the McGill subjects 
reacted to decreased variation with dis- 
comfort or annoyance, nor did Cohen 
et al. (1960a, 1960b) report similar uni- 
form reactions with their subjects at 
Duke. Twenty-nine people began the 
McGill study, while only 18 were suffi- 
ciently “comfortable” enough to finish; 
there must have been both personality 
and motivational differences between 
those who did and those who did not 
finish. Similar influences have been 
drawn in almost all other accounts. Sil- 
yerman and his associates (1958) re- 
ported that, in a pilot study, individuals 
prediagnosed as schizoid personalities 
were more content and even liked the 
situation more than normal subjects; 
body-oriented subjects were less dis- 
turbed than field-dependent ones, and 
sex-role differences were manifested in 
projective responses. These same in- 
vestigators gave anecdotal evidence of 
specific ego involved behavior; subjects 
projectively perceived common objects 
in their studies or employment in a 
single, accidental, fleeting exposure to a 
crack of light, for example, a psychology 
student saw a Rorschach card, a medical 
student saw an X ray, etc. (Silverman 
et al., 1958). The work of Azima and 
Cramer (1956) and Harris (1959) lends 
credence and some suggestion of validity 
to these observations in the sense that 
schizophrenics can be considered as a 
specific class of personality syndrome 1 
that they display commonly similar 


types of responses to sensory isolation, aS 
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r the normal subjects thus 
Pan en studies, though most of 
the latter studies have tended to use un- 
classified, unevaluated volunteers or 
“warm bodies” for subjects. 
Interestingly enough, it has been Te- 
lated (Parry, 1960) that Soviet astro- 
nauts (“cosmonauts”), after being ex- 
posed to isolation and weightlessness as 
part of their preparatory training for 
space flight, are said to have enjoyed this 
condition so much that all of them con- 
fessed that they could think of no better 
way to spend a vacation. It may very 
well be that schizoid personalities, as 
reported by the Duke group, and as 
borne out by the clinical studies men- 
tioned above, might make the best space 
pilots; if being body oriented and con- 
crete minded are the most suitable pre- 
requisites for survival in space, then such 
seemingly atypical personalities might be 
better equipped to do so. Personality 
differences may account for differential 
motivation in volunteering for space- 
flight projects, but it is equally con- 
ceivable that motivation can produce the 
particular personality which finds need 
fulfillment in such ventures. Such con- 
siderations have opened whole new ave- 
nues of research, not only concerned with 
space travel per se, but intimately re- 
lated to the entire field of personality 
dynamics and motivation. 

Another way in which sensory and 
perceptual isolation studies are related to 
the problems of individual differences is 
the widely accepted notion, very fre- 
quently utilized in other kinds of studies, 
of need or deficit. Just as deprivation 
of nourishment results in increased actiy- 
ity, which functionally produces coordi- 
nated motor responses designed to place 
the organism in a more favorable situa- 

tion to acquire nourishment, if it is to 
survive, stimulus deficit also produces 
behavior designed to acquire stimulation, 
if, as Hebb (1955) suggests, it is to sur- 
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vive. Nutrient qualities of stimulation 
could be responsible, in large measure, 
for the need for stimulation, or stimulus- 
seeking behavior manifested by subjects 
who hallucinate, report discomfort, or 
actively attempt to produce situations 
which will ultimately lead to some form 
of real or hallucinated sensory-gratifica- 
tion; the perceptual and cognitive 
changes noted during and after isolation 
may be by-products or artifacts of de- 
ficiency, just as such changes are ob- 
served in subjects undergoing semistar- 
vation (Franklin, Schiele, Brozek, & 
Keys, 1948). It appears quite possible 
that one can be “starved” for stimula- 
tion, as well as for food, and the re- 
sponses associated with such a state seem 
at least dynamically similar to other 
kinds of starvation. Exploratory behav- 
ior or curiosity might be thought of as 
a kind of “snack” with novel stimuli 
for its substance, just as a fully food- 
satiated individual might partake of an 
extra delicacy or two between meals. 
Not being able to obtain food, or stimu- 
lation, because it has been irretrievably 
withdrawn, it then becomes an extremely 
important prerequisite for integrated, in- 
telligent, adaptive behavior. Motivation, 
then, is quite prominent and relevant in 
the analysis of responses to sensory de- 
Privation, invariant patterning, over- 
stimulation, and monotony. 
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THREE THEORETICAL INTERPRETATIONS OF 


ORDER EFFECTS 


IN PERSUASIVE 


COMMUNICATIONS: 


ROBERT E. LANA 
Alfred University 


The “set,” “reinforcement,” and “sensory variation” hypotheses are re- 
viewed in terms of their ability to predict the results of order effects 
in the persuasive situation where opposed arguments on the same topic 
are utilized. The set hypothesis is most successful when the communica- 
tive materials presented are unfamiliar to the S. The sensory variation 
hypothesis is most successful when the topics of the communications are 


concerned with familiar social issues. 


Although interest in the position of 
opposed communications and their effec- 
tiveness in changing opinions was pres- 
ent at least as early as 1925 (Lund, 
1925), research activity was begun in 
earnest only recently. Carl Hovland and 
his associates (1957) produced a volume 
of research and theory which probably 
stirred other researchers to re-examine 
this problem area. A body of research 
on order effects as a result of persuasive 
communications has since developed. It 
is the purpose of this paper to review 
three major theories which attempt to 
explain the empirical results. Primacy 
refers to the success in changing opinion 
of the initial argument of two opposed 
communications, Recency refers to a 
similar success of the argument pre- 
sented second. 

One of the initial explanations of the 
influence of primacy when opposed com- 
munications are Successively presented 
to a subject is that suggested by Luchins 
(1958) and Anderson and Barrios 
(1961), having its immediate history in 
an experiment performed by Asch in 
1946. The concept utilized was that of 
“set” or Einstellung, much in the same 

sense in which the terms were used by 


1This study was supported by Research 
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of Mental Health, United States Public Health 
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nineteenth century psychologists, The 
fundamental idea is that a subject, con- 
fronted with unfamiliar material, will 
establish a tendency to react later in 
terms of the initial material rather than 
in terms of different material presented 
after it. Under this hypothesis, it must 
be assumed that no organization exists 
in the subject with respect to the com- 
munication presented to him, The ini- 
tially unfamiliar material supplies such 
an organization which influences his sub- 
sequent responses (opinion expression 
via questionnaire). Thus, presentation 
of some communication, which supports 
a given opinion or establishes a particu- 
lar description, will render succeeding 
material relatively ineffective in chang- 
ing opinion compared with the first 
presentation. This effect should be maxi- 
mal the greater the temporal contiguity 
of the two presentations since forgetting, 
and interference by external factors, 
would be minimal. This hypothesis de- 
pends heavily upon the condition that 
the presentation of a topic with which 
the subject is unfamiliar be truly estab- 
lished, since any previous contact by the 
subject with the material would consti- 


tute violation of this condition. It thus 


becomes necessary in studies of opinion 


change to devise topical material which 
1S as unique as possible for the subject. 
Current popular social issues are, there- 
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fore, not suitable for use as topics of 
communications. The first elements of 
the communication regarding some 
social issue may simply aid the recall of 
already-formed opinions about the mat- 
ter, and thus the set interpretation will 
not be applicable. The possibility, how- 
ever, exists that a set may be formed in 
the given experimental situation in spite 
of any recall which might be induced in 
the subject by the initial communica- 
tion, An examination of the empirical 
evidence bearing on this point will be 
made below. 

Luchins (1957a, 1957b) presented 
two different descriptions of the same 
individual to high school and college 
students. These descriptions were char- 
acterized by being polar in nature on 
an introversion-extraversion continuum. 
Hence, they were essentially opposed 
communications. Complete lack of fa- 
miliarity with the subject matter was 
assured since the individual who was be- 
ing described (“Jim”) was fictitious. 
The subjects were then asked to describe 
Jim in their own words and answer 
questions about his personality charac- 
teristics. The presentation of the op- 
posed descriptions was juxtaposed for 
the various experimental and control 
groups. Luchins found consistent pri- 
macy effects. He favored a set explana- 
tion of his results, in that the initial 
description directed the later opinions of 
the subjects in much the same way that 
a particular initial solution to 4 problem 
affects attempts to solve later similar 
problems. 

The effects on opinion change of these 
initial impressions may be minimized by 
interposing, throughout the experimental 
process, directions which change the set. 
For example, Luchins (19574, 1957b) 
interposed comments such as, “I want 
each of you [subjects] to suspend judg- 
ment of the individual [“Jim”] about 
whom you are to read until you have 
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completely finished reading all that is 
written about him... .” With com- 
ments such as these, primacy effects 
were reduced in magnitude and, depend- 
ing upon how late in the procedure the 
comment or text was introduced, recency 
effects were sometimes obtained. Luchins 
(1958) also found a recency effect when 
a questionnaire was administered after 
each of two opposed communications 
had been delivered and the topic in- 
cluded different descriptions of the same 
individual. These results were opposite 
to the results of Lund (1925) who found 
primacy effects under the same test con- 
ditions. However, Luchins’ communica- 
tions were markedly different from 
Lund’s. Hovland and Mandell (1957) 
could not replicate Lund’s results. The 
effects of differences in the communica- 
tive materials will be discussed below. 
Thus, Luchins’ results indicate consist- 
ent primacy effects which would be pre- 
dicted by a set theory. 

Anderson and Barrios (1961) have 
also reported experiments which tend to 
support the set hypothesis. They ex- 
amined order effects over a sequence of 
communications on separate issues. Per- 
sonality adjectives were used as the 
communications. Both Asch (1946) and 
Luchins (1958) predict primacy effects 
for initial trials and recency effects for 
later trials. The subjects were instructed 
to attempt to form an impression of an 
imaginary person who was being de- 
scribed by a series of six adjectives, all 
relating to his personality. Favorable 
and unfavorable adjectives were pre- 
sented in various sequences to the ex- 
perimental groups and significant pri- 
macy effects were observed in all rop 
That is, the adjectives presented frst 
were more influential than those pre 
sented later. Also, there was & signifi- 

REE. effect over trials 
cant decline in primacy a 
i f various adjective see 
of presentation © d that when 
tets. The authors conclude 
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only two sets of adjectives (communica- 
tions) are used (as in other experiments 
and their own Experiment II) the true 
order effect is obscured. Hence, several 
communications are necessary for a pri- 
macy effect to develop. Primacy, how- 
ever, decreases over communication sets. 
This decrease in primacy is explained by 
the authors either in terms of a progres- 
sive loss in interest on the part of the 
subject in the task, or an adaptation 
process whereby previous material is in- 
tegrated by the subject with practice; or 
it may be that with increasing exposure 
a set develops to take increasing cog- 
nizance of all the words in the set, thus 
destroying any order effect that might 
have been originally present in the first 
few trials, There are several experiments 
where the set hypothesis does not pre- 
dict the results obtained. This may be 
because the communicative situation in 
such experiments does not contain the 
essential conditions necessary for cor- 
rect predictions from a set hypothesis. 
Studies have appeared in the litera- 
ture which utilize a different procedure 
from that of Luchins and Anderson and 
Barrios. In the studies by Hovland and 
Mandell (1957), Lana (1961) , and 
Thomas, Webb, and Tweedie (1961), 
the communications consisted of prose 
passages on topics of greater or lesser 
importance involving actual social issues 
(atomic submarines, cancer research, 
nuclear weapons, Picasso, etc.). Pretests 
and posttests, usually consisting of iden- 
tical questionnaires, were also used, 
These studies did not present the sub- 
jects with initial set-producing com- 
munications as in the Luchins and 
Anderson and Barrios studies since the 
communications themselves pertained to 
topics with which, it is safe to assume, 
the subjects had had some previous con- 
tact, Also, the fact of the pretest would 
serve, by its content, as a device aiding 
recall of information about the topic. 


In a series of studies by Lana (1961, 
1962) and his associates a consistent 
primacy effect was found when familiar- 
ity with a topic was high and a recency 
effect was evident when familiarity was 
low. 

In one study (Lana, 1961) familiarity 
was manipulated by differentially pro- 
viding information about an unfamiliar 
topic to two groups, while a third group 
received no such information. The “very 
familiar” group yielded a primacy effect 
and the “unfamiliar” group a recency 
effect. The middle “familiar” group 
yielded no significant order effect. 
Rosnow (1962) successfully replicated 
Lana’s results. In this instance existing 
familiarity with the topic was measured 
by a pretest. 

Thomas, Webb, and Tweedie (1961) 
performed three experiments to examine 
the effects of familiarity with a contro- 
versial issue on opinion change follow- 
ing two opposed successive arguments. 
The authors argued that a primacy ef- 
fect can only be expected to appear 
when the topic of communication is first 
exposed to the subject in the experimen- 
tal situation. Subjects were made fa- 
miliar with certain facts associated with 
the communications on the use of cancer 
serum while another group was not 
familiarized with this topic. The usual 
orders of presentation were used for the 
various groups. There was no difference 
between the familiar and the unfamiliar 
groups with respect to primacy. The 
second and third experiments utilized 
delayed posttests, There was no signifi- 
cant primacy effect in either the familiar 
or the unfamiliar groups. Although not 
specifically reported, presumably no sig- 
nificant recency effects were present in 
any of the studies. Their results con- 
firm ours in the experiments mentioned 
above on familiarity. There is no pri- 
macy effect evident with unfamiliat 
groups. Actually, in the studies by Lana 
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and Rosnow a recency effect is in greater 
evidence. One of the differences between 
the Thomas, Webb, and Tweedie studies 
and the Lana experiment is that the de- 
gree of controversiality of the topics is 
probably not the same in both sets of 
procedures. A more controversial topic 
was utilized in the Lana experiment than 
in the Thomas, Webb, and Tweedie 
study. This conclusion seems to be war- 
ranted by the fact that Lana’s subjects 
(1962), in another context, rated cancer 
research (used by Thomas, Webb, & 
Tweedie, 1961) to be of medium con- 
troversiality compared to nuclear weap- 
ons, Khrushchev, etc. However, Thomas, 
Webb, and Tweedie showed that the 
topic of use of cancer serum was per- 
ceived to be controversial by the sub- 
jects. These results are superficially 
opposite to those predicted from the set 
hypothesis, From the set hypothesis it 
follows that the more familiar an indi- 
vidual is with a given topic the less 
likely he is to establish a set on the com- 
munication presented first which should 
result in a recency effect. Conversely, 
the less familiar an individual is with a 
topic the more likely he is to establish 
a set with respect to the first communi- 
cation thus yielding 4 primacy effect. 
However, since the topics used by Lana 
and Thomas, Webb, and Tweedie were 
already familiar to college subjects, some 
familiarity already existed even for 
those groups termed unfamiliar in the 
various experiments. 

Other studies by Lana (1962) and 
Schultz (1963b) where controversy of 
and interest in the topic and awareness 
of the intent of the experimenter by the 
subject were manipulated, yielded per- 
plexing results. If it may be assume 
that “controversy,” “interest,” and 
“awareness”? may all be related to the 
formation of set in the communication 
situation, then this hypothesis is ap- 
parently unable to predict the empirical 


results. Schultz (1963b) found that 
groups that had been made strongly 
aware of the intent of the communica- 
tor produced no order effect when con- 
fronted with opposed communications. 
A moderately-aware group yielded a 
primacy effect, and a recency effect was 
present in a group of low awareness. 
According to the set hypothesis, the low- 
aware group should have yielded the 
primacy effect and the high-aware group 
the recency effect. Lana (1962) showed 
that primacy appeared in groups ex- 
posed to a very controversial topic and 
where there was great interest in the 
topic. The set hypothesis would have 
predicted recency effects for these groups 
or no effect at all. At this point it is 
perhaps better to conclude that the set 
hypothesis is not structured to explain 
order effects when communications con- 
cerning familiar social problems are 
involved, rather than to conclude that it 
is made tenuous by such studies. 

A “reinforcement” or “conditioning” 
explanation of order effects has been 
utilized in certain contexts by McGuire 
(1957) and by Rosnow (1962). The 
prediction of a particular order effect 
depends upon the ability of the experi- 
menter to create or locate a rewarding or 
punishing group of stimuli in the com- 
munication situation. It is problematic 
whether or not reward or punishment 
as reinforcers can be located as factors 
within the experiments described above. 
An individual may be convinced of an 
argument to either increase Or decrease 
support of, for example, cancer research 
in the face of the opposite position with- 
out a reward or a punishment being con- 
tingent on his taking st ae 

Pat McGuire (1957) 4 conditioning 
trial” occurs when a SU 
the communication ii T ee 
and this agreemen| is 

j inforcement of the ag 
there iDO m Į” has occurred- 


ment, an “extinction tria 
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It was necessary to establish that the 
topics had either pleasant or unpleasant 
contingencies involved with a belief in 
any of them. McGuire did this by ad- 
ministering appropriate questionnaires. 
He demonstrated that “the effect of a 
source’s earlier communications on the 
persuasiveness of his later messages de- 
pends in large measure on the extent 
to which agreement with those earlier 
communications was rewarding for the 
recipient.” Reward consisted of the 
situation where results previously judged 
to be desirable by the subjects (e.g., 
federal aid to education) were regarded 
as likely to occur by the communicator, 
It is to be noted that McGuire’s experi- 
ment dealt with order effects within a 
communication rather than with order 
effects concerning opposed communica- 
tions as used in the above studies. How- 
ever, Rosnow’s (1962) experiment was 
Concerned with successive opposed argu- 
ments. He created a punishing situa- 
tion, the relief from which he character- 
ized as reinforcement. Anxiety was 
created among high school students by 
administering an unannounced quiz in 
their Psychology class. Two days later 
Pro and con arguments concerning civil 
defense were Presented either before or 
after anxiety about quiz results had been 
reduced by informing the students that 
their grades would not count on their 
records. Throughout the study, Opinion 
questionnaires were administered as well 
as a mood questionnaire to check the 


effects of the anxiety-producing situa- 
tion. 


Rosnow’s principal hypotheses 
were that reinforcement occurring before 


the presentation of opposed arguments 
would yield a primacy effect, and that 
reinforcement occurring after the pres- 
entation of opposed arguments would 
yield a recency effect. These predictions 
were made on the supposition that con- 
tiguity between the reinforcement and 
either the first- or second-presented argu- 


ment would strengthen the tendency to 
respond in that direction. The results 
indicated that, although anxiety had 
been successfully created, there were no 
consistent order effects resulting from 
reinforcement (anxiety reduction) be- 
fore or after the presentation of the argu- 
ments. Since no clear-cut effects were 
present, no further information concern- 
ing a reinforcement interpretation of 
order effects is available at this time. 

The most recent attempt to explain 
order effects in persuasive communica- 
tions has been made by Schultz (1963a). 
Schultz developed four postulates and 
two corollaries from a physiological “sen- 
sory-variation” hypothesis based on the 
work of Hebb (1955), Scott (1957), 
Fiske and Maddi (1961), Lindsley 
(1957), and Malmo (1959). The princi- 
pal idea is that human beings are dis- 
posed to seek high activation and, hence, 
to react more strongly to novel stimuli 
than to stimuli with which they are 
familiar, since novel stimuli provide 
higher cortical activation, other things 
being equal. We need not be detained 
by this point (for details see Schultz, 
1963a). From this general point of view 
Schultz derives the four following hy- 
potheses: 

1. When an individual is exposed to a 
topic for the first time (as in Luchins, 
Asch, Anderson and Barrios, called 
“true” primacy studies by Schultz) the 
first communication is perceived as novel 
and mediates increased cortical activa- 
tion. A second (opposed) communica- 
tion on the same topic provides little 
activation compared with the first com- 
munication and, hence, in order to main- 
tain activity, the individual reacts in ac- 
cordance with the position advocated by 
the initial communication. A primacy 
effect results. 

2. If the subject is required to register 
his opinions after each (pro and con) 
communication, then, in order to main- 
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tain the level of activation induced by 
the first communication, he will respond 
on the second questionnaire in opposi- 
tion to his response on the first question- 
naire. A recency effect will result. 

3. Time or task interpolation between 
first and second presentations of the op- 
posed communications serve to dissipate 
activation level induced by the first com- 
munication and, therefore, the second 
communication will be more effective and 
a recency effect or no effect at all will 
occur. The second communication has 
the characteristics of a novel stimulus 
under these conditions. 

4. When studies utilize communica- 
tions which are not initial (as when they 
are about current social issues, such as 
integration, aid to education, etc.; called 
“experimentally induced primacy stud- 
ies” by Schultz) then the first communi- 
cation will not produce as high an activa- 
tion level as if it were an initial exposure. 
The first communication may still be 
somewhat novel, however, and may de- 
termine some increased activation and 
attention. Hence, a less pronounced 
primacy effect (as compared with true 
primacy studies) or no effect at all is 
predicted. 

Schultz (1963a) then appends two 
collaries (a and b) to his fourth postu- 
late. According to @, if, under the condi- 
tions of 4 above, the social issue has an 
intense stimulus value along dimensions 
such as familiarity, interest, or con- 
troversiality, then the dynamics of the 
communication situation parallels Postu- 
late 1 above and a more pronounced 
primacy effect should occur. Schultz 
makes the implicit assumption that di- 
mensions such as interest in the topic 
and degree of controversy as perceived 
by the subject have the same systematic 
status as familiarity of the subject with 
the topic, which is the dimension in- 
volved in the true primacy studies. 
Corollary b states that in the situation 


described in 4a a recency effect should 
occur when a questionnaire is inter- 
polated between the two communica- 
tions. 

Since Schultz makes the same predic- 
tions as set theorists (derived from dif- 
ferent initial assumptions) for his true 
primacy studies his success is equal to 
theirs. 

Difficulty arises with Schultz’s predic- 
tions 4, 4a, and 4b in studies which are 
not “initial” in the sense used through- 
out this paper. 

The earliest study cited in this con- 
text is the one by Lund in 1925. Schultz 
(1963a) predicts a recency effect by 
Postulate 2, but Lund reported a primacy 
effect with questionnaires presented after 
each communication (pro and con). 
Cromwell (1950), using a single ques- 
tionnaire after the presentation of both 
arguments, indicated a recency effect was 
present, but Schultz predicts primacy. 
Hovland and Mandell (1957) replicated 
Lund’s study using a different topic and 
their results are also inconsistent with 
Schultz’s predictions. The success of his 
model with induced primacy studies in- 
creases somewhat when applied to some 
of the results of Lana (1961, 1962). 
From Shultz’s Postulates 4 and 4a, 
primacy is predicted for groups with 
high familiarity with the topic and when 
the topic is very controversial. However, 
Schultz does not report that, where his 
Postulates 4 and 4a predict the results 
for familiarity and controversy, they do 
not adequately handle Lana’s results 
(1962) in an experiment where high- 
interest groups yielded no effect and ia 
medium-interest group yielded a Pre 
macy effect. Schultz must predict a 
macy for the high-interest groups an 
no effects or recency for the nerd 
interest groups. In another study ( TE 
1962) high controversy, in pce Si 
with method of pretesting, yae ale s 
der effect and medium contr y 
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yielded a significant primacy effect. 
Schultz simply cannot „predict all of 
Lana’s results as he implies in his paper. 
However, of the three theories reviewed, 
it seems clear that the sensory-variation 
hypothesis comes closest to predicting 
order effects where the topics represent 
current social issues. 
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The rate o: 


f acquisition of discrimination learning set in rhesus monkeys 


is related to amount of within-problem learning, problem difficulty, and 


amount of separati 
naive monkeys have 


‘on between stimuli and response loci. Experimentally 
a weak tendency to approach novel stimuli, but 


# because of generalization from the positive to the negative stimulus within 
a problem, experimentally sophisticated monkeys tend to approach famil- 
iar negative stimuli rather than new stimuli. Early in training there are 


more errors following a first-trial error th 


-trial response, but later this trend is reversed. All of the data can be 


accounted for by Hull-Spence theory, 


tions added. 


This article has two main purposes: 
first, to review the research on discrim- 
ination learning set in rhesus monkeys, 
and second, to outline a theoretical anal- 
ysis of learning set based on Hull-Spence 
learning theory. 

The few studies of learning-set forma- 
tion that used infrahuman primates other 
than rhesus monkeys as subjects are ex- 
cluded because the interpretation of the 
obtained species differences is compli- 
cated by the use of different experimen- 
tal procedures (see Harlow, 1959) .(Data 
of irradiated monkeys and monkeys with 
brain Jesions have also been excluded.) 
The review is limited to discrimination 
learning set (Harlow, 1949) because it 
has been more thoroughly investigated 
than the other kinds of learning set, such 
as oddity learning set (e.g, Meyer & 
Harlow, 1949; Moon & Harlow, 1955) 
and reversal learning set (€-8+ Harlow, 
1949; Meyer, 1951), and in spite of the 

1The writer is indebted to several col- 
leagues, especially B. R- Bugelski, Jobn W. 


Davenport, and Billey Levinson, who read and 
criticized earlier versions of this paper- 


an following a correct first- 


with a small number of assump- 


use of many different experimental pro- 
cedures, a “standard” technique for es- 
tablishing a discrimination learning set 
(hereafter called “Jearning set”) can be 
described. In the standard learning-set 
technique, the subject is given a series of 
problems that have a common basis of 
solution; each problem involves: (a) 
simultaneous discrimination between two 
stimulus objects, with a different pair of 
stimuli for each problem; (b) blind bait- 
ing of the correct stimulus; (c) reward 
for every correct response; (d) a non- 
correctional procedure; and (e) some 
small, fixed number of trials (on each 
problem). Improvement from problem 
to problem in the performance level on 
a given within-problem trial, usually 
Trial 2, is used as a measure of inter- 
problem transfer or learning-set forma- 


tion. 
Current analyses of lear! 
deal with “error factors” (Harlow, 1949, 


1950, 1959), “hypotheses” (Levine, 
1959), or “strategies” (Bowman, 1961). 
These are response-defined terms, refer- 
ring to systematic sequences of responses, 
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ning-set data 
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as did Krechevsky’s (1932) term “hy- 
pothesis.” Naming the response se- 
quences does not explain them, however, 
and no mechanism to account for the 
development and operation of strategies, 
hypotheses, or error factors has been 
suggested, although several theorists 
have dealt with the problem (e.g., Har- 
low, 1950, 1959; Harlow & Hicks, 1957; 
Levine, 1959; Restle, 1958, 1960, 1962). 
There have been suggestions in the re- 
search literature on learning sets in mon- 
keys that various findings can be ac- 
counted for by traditional learning 
theory. Harlow (1950), for example, 
noted that Hull’s (1943) concept of re- 
active inhibition could account for one 
of the error factors (“response-shift”) 
[although he later rejected the notion 
(Harlow, 1959)], and Riopelle, Cron- 


a and Addison (1962) concluded 
that: 


‘ Ors in gradual learning-set for- 
mation ... are to be found in cue stimuli and 


the interproblem relations rather than in the 
acquisition of supportin, 


tion of antagonistic habits Cp. 277]. 


or negative earlier in training, 

The notion that an analysis may be 
based on traditional learning theory is 
not new (see Bugelski, 1956), but no 
such analysis has Previously been at- 
tempted. 


REVIEW OF THE LITERATURE 


The writer has reinterpreted the data 
of many of the studies, requiring that the 
experimental designs be presented in 
more detail than is customary in reviews. 
Especially in the section on Reactions to 


Novel Stimuli, the new interpretations 
yield conclusions that were not originally 
apparent, 


Experimental Variables 


Number of Trials ber Problem. Ac- 


cording to Levine, Levinson, and Harlow 


(1959) the rate of learning-set formation 
is a function of the number of trials pre- 
sented, regardless of how these trials are 
organized into problems, provided the 
number of trials per problem is between 
3 and some number greater than 12 but 
less than 50. When more trials than the 
upper limit are given on each problem, 
the rate of learning-set formation is re- 
tarded. Levine, Harlow, and Pontrelli 
(1961) obtained similar results, but var- 
ied the number of trials per problem 
within subjects rather than between sub- 
jects. These results are obtained by 
plotting performance against the number 
of trials, not the number of problems: 
the number of Problems required to 
reach any specified Performance level is 
reduced when the number of trials per 
problem is increased, within the specified 
limits, 

Kind of Stimuli, Learning-set forma- 
tion is faster when “stereometric” objects 


Problems usin 
lems using 


from stereometric to patt 


“a 
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Contiguity of Stimulus and Locus of 
Response. McDowell and Brown (1960) 
gave 234 “peripheral-cue” problems 
(with about a 3-inch separation between 
stimuli and response loci), and although 
there was significant within-problem im- 
provement, there was no significant in- 


“terproblem improvement. The subjects 


had had previous experience with stand- 
ard learning-set problems, but appar- 
ently could not acquire a peripheral-cue 
learning set. Murphy and Miller (1955) 
gave one group of subjects standard 
learning-set problems and another group 
a kind of peripheral-cue problem in 
which the stimuli were located 6 inches 
above the manipulanda. The peripheral- 
cue group did not acquire a learning set, 
although they were given more than 600 
problems; but the standard group ac- 
quired a learning set. After 576 prob- 
lems the standard group was switched to 
the peripheral-cue problems, and the 
learning set disappeared; performance 
dropped to the level of the peripheral-cue 
group. 

Riopelle, Wunderlich, and Francisco 
(1958) used pattern stimuli consisting 
of various numbers of concentric rings, 
with the relevant cue located in the cen- 
ter or in one of the rings. The perform- 
ance level was higher, the closer the cue 
was to the perimeter; and although the 
number of rings (inversely related to 
size of stimuli) was a significant variable, 
there was little effect of the number of 
tings when the cue was on the perimeter. 
Warren (1963c) obtained similar results. 
He varied the positions of pattern stimuli 
on background cards and found the high- 
est level of performance when the stimuli 
occupied the borders of the cards, the 
worst performance when the stimuli oc- 
cupied the centers of the cards, and in- 
termediate performance when the stimuli 
occupied the center and part of the bor- 
der. The subjects in these studies prob- 
ably displaced the stimulus cards by 
touching the edges of the cards, so that 


the stimulus and locus of response were 
contiguous on the problems in which the 
cues were on the perimeters of the cards. 
Schuck, Polidora, McConnell, and Meyer 
(1961) found performance on center-cue 
problems superior to performance on bor- 
der-cue problems; in their study the re- 
sponse units were centered on the stimuli 
(see also McConnell & Schuck, 1962). 

In general, discrimination learning is 
easier when the stimuli and locus of re- 
sponse (reaching for food) are spatially 
contiguous than when they are separated 
(McClearn & Harlow, 1954; Meyer, 
Polidora, & McConnell, 1961; Murphy 
& Miller, 1958; Schuck, 1960; Stollnitz 
& Schrier, 1962) and Otteson, Sheridan, 
and Meyer (1962) have concluded that: 
monkeys, if given opportunities to do so, 
monitor their fingertips so closely that they 
fail to sample broadly from discriminanda. . . - 
Spatial contiguity becomes important only 
when the sampling process is affected [p. 937]. 

Size of Stimuli, Warren (1953b, 
1953c) and Blazek and Harlow (1955) 
varied the size of pattern stimuli, by 
covering various percentages of the cen- 
ters of the background cards, and found 
that increasing the area covered in- 
creased the performance level. Accord- 
ing to Schuck et al. (1961), the data of 
such studies can be interpreted in terms 
of variation of the separation between 
stimulus and response loci. The greater 
the percentage of the card covered by 
the stimulus, the closer the stimulus is to 
the locus of response. 

Two studies discussed above (Riopelle 
et al., 1958; Warren, 1953c) varied both 
the area and the position of the stimuli, 
and found significant effects of both 
variables. If increasing the area only 
reduces the separation between the stim- 
ulus and response locus, an interaction 
between area and position should be ob- 
tained; on the border-cue problems there 
is no separation, and therefore a 
should have no effect on performance; 
but on the center-cue problems, increas- 
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ing the area should reduce the separation 
between the stimulus and response locus, 
and therefore increase the level of per- 
formance. The required interaction was 
obtained in both studies, but was sta- 
tistically significant only in the study by 
Riopelle et al. The required simple 
effects very closely approximated the 
data of Riopelle et al., but described 
Warren’s data less well. Warren’s curve 
for the border-cue position was not flat, 
but the rise was less than in the center- 
cue curve. 


Performance Variables 


Retention of Learned Associations. 
Several studies have shown remarkably 
high retention of responses to specific 
stimuli, In a study of learning-set de- 
velopment in infant monkeys, Mason, 
Blazek, and Harlow (1956) unintention- 
ally repeated a series of 90 six-trial prob- 
lems after an interval of about 1 month, 
and found more than chance correct 
Trial-1 in the repetition, 
Strong (1959) deliberately tested reten- 


d nega- 
w stim- 
to 67% of the responses 
were correct, but the relatively low level 


of retention may be attributable to the 
kind of retention test used (see follow- 
ing section on Reactions to Novel 
Stimuli). 

Reactions to Novel Stimuli. Experi- 
mentally naive monkeys tend to ap- 
proach new stimuli and experimentally 
sophisticated monkeys tend to approach 
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familiar stimuli. Riopelle et al. (1962) 
gave experimentally naive monkeys four 
standard problems daily, and six daily 
problems each with one recurrent stimu- 
lus and one new stimulus. In the group 
of major relevance here, the six recur- 
rent stimuli were positive on some prob-, 
lems and negative on other problems, 
the reward value changing randomly 
from day to day. This group exhibited 
a weak tendency to respond to the new 
stimuli on the first trial of the recurrent- 
stimulus problems for the first 20 days 
of the experiment, but eventually de- 
veloped a tendency to respond to the 
familiar (recurrent) stimuli, although 
after 500 problems the preference for the 
familiar stimuli appeared to decline. 

Fitzgerald and Davis (1960) gave a 
test trial on each problem, presenting 
either the positive or the negative stimu- 
lus with three new stimuli. When the 
positive stimulus was presented on test 
trials, all subjects tended to respond 
correctly. When the negative stimulus 
was presented on test trials, experi- 
mentally naive monkeys tended to re- 
spond to the position last occupied by 
the positive stimulus, and experimentally 
sophisticated monkeys responded most 
frequently to the negative stimulus. Ex- 
perimentally sophisticated subjects, 
then, responded to the familiar, negative 
stimuli, but experimentally naive sub- 
jects did not, 

Leary (1958) gave “ambiguous-cue” 
problems to subjects that had had ex- 
perience with object discriminations and 
other kinds of problems for more than 
3 years. The positive stimulus (A) and 
the “ambiguous” stimulus (C) were pre- 
sented on the first trial (AC trial) of 
each ambiguous-cue problem, and the 
negative stimulus (B) and the ambigu- 
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stimulus. There were many more errors 
on the AC trials (both stimuli new) 
than on the BC trials (B new, C cor- 
rect). Furthermore, on transfer prob- 
lems subjects initially tended to respond 
to Stimulus C, avoiding a new stimulus 
(D). These experimentally sophisti- 
cated subjects, then, tended to approach 
familiar stimuli. 

Table 1 gives the designs and relevant 
data of five additional studies. As shown 
in Table 1, Riopelle (1955) gave five 
kinds of problems: (2) Positive-Positive 
—positive stimulus of a problem retained 
as positive stimulus in the next problem, 
(b) Negative-Negative—negative stimu- 
lus retained as negative, (c) Positive- 
Negative—positive stimulus retained as 
negative, (d) Negative-Positive—nega- 
tive stimulus retained as positive (the 
reversals never occurred without at least 
one intervening problem), and (e) both 
stimuli new. The subjects were experi- 
mentally naive. 

Performance on the Negative-Nega- 
tive condition was superior to perform- 
ance on the Positive-Positive condition, 
as indicated in Table 1, suggesting a 
tendency to approach the new stimulus. 
On the Negative-Negative condition the 
tendency to avoid the old negative stimu- 
lus and the assumed tendency of experi- 
mentally naive subjects to approach the 
novel stimulus are both correct; but on 
the Positive-Positive condition the tend- 
ency to approach the novel stimulus 
competes with the tendency to approach 
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the positive stimulus. The superiority of 
the Positive-Positive condition to the 
condition on which both stimuli were 
new indicates that the tendency to ap- 
proach the old positive stimulus was 
stronger than the tendency to approach 
the new stimulus; on both conditions two 
approach tendencies compete, but there 
must have been less competition on the 
Positive-Positive condition. The superi- 
ority of the Negative-Positive condition 
to the Positive-Negative condition indi- 
cates that the tendency to approach the 
old positive stimulus was greater than 
the tendency to avoid the old negative 
stimulus (the subjects in the Negative- 
Positive condition overcame the tend- 
ency to avoid the old negative stimulus 
faster than the subjects in the Positive- 
Negative condition overcame the tend- 


gi 
ency to approach the old positive stimu- 


lus). 
Brown, Overall, and Gentry (1958) 
gave Positive-Negative problems to one 
group, and Negative-Positive problems 
to another (reversals occurred without 
intervening problems). The subjects 
had had extensive laboratory experience 
including training on object-discrimina- 
tion problems. First-trial performance 
on the Negative-Positive problems was 
superior to first-trial performance on the 
Positive-Negative problems (see Table 
T: 
According to Brown et al. (1958), 
«First-trial solution is explicable in 
terms of the formation of a concept of 


TABLE 1 

Orper or Ease OF TRANSFER PROBLEMS FOLLOWING STANDARD DISCRIMINATION PROBLEMS 

ü =e Negtivee Positive Negative- Both New 
Experiment Positive Negative Negative ositive 
3 

A 5 4 
Riopelle (1955) 2 Es r 1 cs 
Brown et al. (1958) = 1 2 es = 
Brown et al. (1959) — 5 ae a = 
Leary (1956) 1 a E = 
Behar (1962) 1 2 

Jained in text. 


Note.—Order is from easiest 


to most difficult. Labels are exp 
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€ ? or ‘novelty’ and the association 
Pe A porse with the conceptual stimu- 
lus class [p. 705].” The response tend- 
ency was to avoid new stimuli, and it 
might be noted that if subjects tend to 
avoid new stimuli, they may be said to 
have a “concept” of newness. (The sub- 
jects have a tendency to “avoid” a 
stimulus if the excitatory potential of 
the stimulus is less than the excitatory 
potential of the other stimulus.) Mon- 
keys apparently can learn simple con- 
cepts (e.g., Hicks, 1956; Stone, 1961). 
In the Negative-Positive condition 
one of the response tendencies, the as- 
sumed tendency of sophisticated subjects 
to avoid the new stimulus, is correct. As 
training continues and learning set de- 
velops, within-problem learning improves 
So that the subjects make very few er- 
Tors and therefore acquire only a weak 
tendency to avoid the negative stimulus 
(as Brown et al., 1958, suggested). The 
Weak tendency to avoid the negative 
stimulus is incorrect on the reversal, but 
does not compete effectively with the 
stronger, correct tendency to avoid the 
new stimulus. The reversal, then, js 
achieved without error, 
rown et al, (1958) included another 
group that was trained first on the Nega- 
tive-Positive condition and then on the 
Positive-Negative condition, Perform- 
ance on the second condition was su- 
perior to performance on the first. This 
finding is predictable from the stimulus- 
response analysis, since the excitatory 
potentials of the stimuli would be more 
nearly equal at the beginning of the 
second training period than the first. 
Brown, Overall, and Blodgett (1959) 
gave Positive-Negative and Negative- 
Negative problems. All but 2 of the 33 
subjects had been used by Brown et al. 
(1958) and the other 2 also had exten- 
sive laboratory experience on a variety 
of problems. First-trial performance on 
the Negative-Negative problems was 


better than on the Positive-Negative 
problems. 

In a Negative-Negative problem one 
of the response tendencies, the tendency 
to avoid the negative stimulus, is cor- 
rect, and the problem should therefore be 
easier than the Positive-Negative re- 
versal, in which both response tenden- 
cies are incorrect. The development of 
first-trial solution of both kinds of prob- 
lem might indicate that a tendency to 
approach new stimuli gradually became 
stronger than the tendency to avoid new 
stimuli. (It might be noted that in learn- 
ing to approach a given new stimulus, the 
subject does not necessarily learn an 
approach response to the class of “new 
stimuli,” since experience makes a new 
stimulus familiar.) Alternatively the 
first-trial solutions could be explained 
by patterning. A pattern consisting of a 
familiar, positive stimulus and a new 
stimulus serves as a cue for a reversal; 
and a pattern consisting of a familiar, 
negative stimulus and a new stimulus 
does not. The assumption that the mon- 
key can learn to respond differentially 
to such cues is supported by Riopelle and 
Copelan’s (1954) finding that reversal 
can be stimulated by a change in back- 
ground color. 

Leary (1956) gave Positive-Positivė 
and Negative-Negative problems ‘to ex- 
perimentally sophisticated subjects and 
found more errors on the latter, indicat- 
ing that the subjects tended to avoid the 
new stimuli, 

Using subjects that had previously ac- 
quired a learning set (Behar, 1961b), 
Behar (1962) gave problems with 4, 7; 
or 10 discrimination trials followed by 
a test trial on which either the positive or 
the negative stimulus was presented with 
a new stimulus. On the Positive-Positive 


test trials, about 96% of the responses 


were to the positive stimulus: and on 
the Negative-Negative test trials, 44% 
of the responses were to the negative 
stimulus, in line with Leary’s (1956) 
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data, The tendency to avoid new stim- 
uli need not be strong, since the tend- 
ency to avoid the negative stimulus is 
probably weak in subjects that have ac- 
quired a learning set. Only 2.6% of the 
responses of Behar’s (1962) subjects on 
the discrimination trials (excluding Trial 
1) were to the negative stimulus. 
Additional data in Behar’s (1962) 
study suggest that there is generaliza- 
tion from the positive to the negative 
stimulus within a problem. On the Nega- 
tive-Negative test trials, the percentage 
of responses to the negative stimulus de- 
pended on whether or not errors occurred 
on the discrimination trials of the prob- 
lem, When no errors were made on the 
discrimination trials, 63% of the re- 
sponses were to the negative stimulus on 
test trials; but when one error or more 
occurred on the discrimination trials, 
the subjects responded to the negative 
stimulus on only 28% of the test trials. 
Furthermore, even when at least one 
error was made on the discrimination 
trials, the percentage of responses to the 
negative stimulus on test trials increased 
as the number of discrimination trials 
increased. After 4 discrimination trials, 
there were 237% responses to the nega- 
tive stimulus; after 7; 28%; and after 
10, 33%. As the tendency to approach 


the positive stimulus increased, with in- 
i jals, the generalized 


creasing training tr n ? 
tendency to approach the negative stim- 
ulus would increase, increasing the likeli- 
hood of responses to the negative stim- 
ulus on test trials. 
Sheridan, Horel, an 
data also support the 
each problem after 
sponded, either (a) 
sisted for 2 seconds, 
disappeared immediately, 
tive stimulus alone persisted, oF (d) the 
negative stimulus alone persisted. Per- 
formance was about the same on the first 
two conditions and inferior to perform- 
ance on the last two conditions. Perform- 


d Meyer’s (1962) 
conclusion. On 
the subject Te- 
both stimuli per- 
(b) both stimuli 
(c) the posi- 


ance was about the same on the last two 
conditions (38% versus 35% errors). 
Generalization between the positive and 
negative stimuli is, then, apparently re- 
duced when only one stimulus persists 
after the response, and therefore re- 
sponse competition is reduced and per- 
formance improves. (Mowrer’s theory, 
1960, Chapter 12, also accounts for 
these results. Generalization is not in- 
volved, but the probability of discrimi- 
nating between the positive and nega- 
tive stimuli is increased when only one 
stimulus persists, since it is “compared” 
with a stimulus trace, and learning is 
facilitated.) 

The data of five other studies, in- 
cluded in Table 2, indicate that naive 
subjects tend to approach novel stimuli, 
but Table 2 also includes six studies 
which indicate that experimentally 50- 
phisticated subjects tend to approach 
novel stimuli, In all of these studies the 
discrimination trials were preceded by 
presentations of single stimuli, which 
minimizes within-problem generaliza- 
tion. Since the previously discussed 
studies used procedures more like the 
standard learning-set technique, with 
simultaneous presentation of the posi- 
tive and negative stimuli (which en- 
hances within-problem generalization) , 
the development of a tendency to avoid 
novel stimuli must be related to within- 
problem generalization. Experimentally 
sophisticated subjects may retain a tend- 
ency to approach novel stimuli, but if 
generalization occurs from the positive 
to the negative stimulus within a prob- 
lem, the tendency to approach familiar, 
e stimuli becomes stronger than 
the tendency to approach novel stimuli, 

ts avoid novel stimuli. If 


and the subjects avole ; i 

generalization js minimized, the subjects 

approach novel stimuli. 7 i 
In the experimental conditions 1m- 


cluded in Table 2, each problem con- 
sisted of pretraining followed by dis- 
crimination trials with the pretraining 


negativ 
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TABLE 2 


F Ease oF DISCRIMINATION PROBLEMS WITH SINGLE-STIMULUS PRETRAINING 
ORDER O; 


Reward- 
Positive 


Experiment 


Nonreward- Nonreward- 


Negative 


Reward- 
Negative 


Warren and Brookshire (1959) 
i 1958) 
Lom: and Sinha (1959) 
Warren and Hall (1956) 
Blomquist and Harlow (1962) 
Brush et al. (1961) 
Mishkin et al. (1962) 
Harlow and Hicks (1957) 
Leary (1956) 
Moss and Harlow (1947) 
King and Harlow (1962) 
Group 75 
Group 50 
Group 25 


NNN N N e eee 


m 


2 


2 
2 
2 


Positive | 


1 
1 3 4 
1 1 2 
1 
1 
1 


Not 


stimulus and a new stimulus. In pre- 
training, the positive or negative stimu- 
lus of the discrimination trials was pre- 
sented alone, with or without reward, 
yielding the four conditions given in 
Table 2, 

After 200 Pretraining trials, Warren 
and Brookshire’s (1959) experimentally 
naive subjects made a mean of 6.0 er- 


Reward-Positive 


Negative problem. The tendency to ap- 
proach the new stimulus appeared to be 
almost as strong as the conditioned- 
approach tendency resulting from 200 
reinforcements, on the assumption that 
the subject attended to the stimulus on 
each pretraining trial, This assumption 
is, however, dubious, as indicated in the 
final section, 

Sinha (1958) obtained data Suggesting 
that there was an initial tendency to ap- 
proach new stimuli in subjects that had 
served in one previous experiment, Eight 
pretraining trials were given in (a) the 
Reward-Positive condition and (b) the 
Reward-Negative condition. Sinha also 
gave (c) four baited presentations each 
of the positive stimulus and the negative 


€.—Order is from easiest to most difficult, Labels are explained in text. 


stimulus (Condition AB), and (d) no 
pretraining (control), 

After 40 problems on each condition, 
the monkeys solved each kind of problem 
immediately (indicating that learning 
sets developed). Over the first 30 prob- 
lems, however, performance on the Re- 
ward-Positive condition was superior to 
Performance on the control condition, 
and performance on the control condi- 
tion was superior to performance on the 
other two conditions, Performance on 
Condition AB was superior to perform- 
ance on the Reward-Negative condition, 
but only over the first 10 problems. 
Sinha showed that the results are in line 
with Spence’s (1938) stimulus-response 
analysis of discrimination learning until 
the learning sets are acquired. Since 
two competing approach tendencies were 
involved on each condition, the order- 
ing must have resulted from differences 
in the amount of competition. The order- 
ing of the conditions is explained if the 
tendency to approach new stimuli was 
weaker than the approach tendency re- 
sulting from four reinforcements, 

_ Warren & Sinha (1959) obtained 
similar results, using experimentally 


| 


| 
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naive subjects and pattern stimuli 
(Sinha used stereometric objects). There 
was significant facilitation on Trial 1 
of the Reward-Positive condition, and 
significant interference in the Reward- 
Negative condition; performance on 
Condition AB and on the control condi- 
tion was not significantly different from 
chance. Warren and Hall (1956) also 
obtained essentially the same results, 
using subjects that were apparently rela- 
tively naive. 

Blomquist and Harlow (1962) gave 
experimentally naive subjects problems 
each with one pretraining trial and one 
discrimination trial. There were more 
errors on Reward-Positive problems than 
on Nonreward-Negative problems, as 
expected if the subjects tend to ap- 
proach new stimuli. 

Experimentally sophisticated subjects 
in a study by Brush et al. (1961, Ex- 
periment II) avoided the pretraining ob- 
ject on the first discrimination trial of 
Nonreward-Negative and Nonreward- 
Positive problems 95% of the time, but 
on Reward-Positive and Reward-Nega- 
tive problems they chose the pretraining 
object only 55% of the time. This find- 
ing would be expected if there was an 
initial tendency to approach new stimuli. 
Such a response tendency would com- 
pete with the tendency to approach the 
previously baited stimulus, but not with 
the tendency to avoid the previously 
nonbaited stimulus. 

The fewest errors should be on the 
Nonreward-Negative condition, since 
both of the initial response tendencies 
are correct; the most errors should be 
on the Nonreward-Positive condition, 
since both initial response tendencies are 
incorrect; and there should be interme- 
diate numbers of errors on the other 
two conditions, on each of which one re- 
sponse tendency is correct and one 1s 
incorrect. These postdictions were con- 
firmed, Furthermore, there were fewer 


errors on the Reward-Positive condition 
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than on the Reward-Negative condition, 
indicating that the initial tendency to 
approach the new stimuli was relatively 
weak. When the tendency to approach 
the pretraining stimulus is stronger than 
the tendency to approach the new stimu- 
lus, there is less effective competition if 
the pretraining stimulus is correct than 
if it is incorrect. 

Mishkin, Prockop, and Rosvold 
(1962) used the same subjects as Brush 
et al. (1961). Reward-Positive and Re- 
ward-Negative problems were given first, 
then Nonreward-Negative and Nonre- 
ward-Positive problems. Because of pos- 
sible facilitating transfer from the first 
to the second series, this study does not 
allow a comparison of the first two con- 
ditions with the second two conditions. 
As in Brush et al., there were more er- 
rors on the Nonreward-Positive condi- 
tion than on the Nonreward-Negative 
condition; but contrary to the earlier 
study, there were fewer errors on the 
Reward-Negative condition than on the 
Reward-Positive condition. The latter 
result would be expected if the tendency 
to approach the pretraining stimulus was 
weaker than the tendency to approach 
the new stimulus, exactly opposite to 
the conclusion suggested by the data of 
Brush et al. Mishkin et al., however, 
gave only one pretraining trial, and 
Brush et al. gave five. The tendency to 
approach the pretraining stimulus would 
be weaker in Mishkin et al., than 1m 
Brush et al., and could be weaker than 
the tendency to approach the new stimu- 
lus. A 
Performance on the Nonreward-Post- 
tive and Reward-Positive conditions in 
Mishkin et al. (1962) was better ven 
chance, contrary to implications of the 

f s In all of the 
analysis outlined here. 
i Sinha (1958), Worse 
other studies except med 
e was obtaine! 
than chance performance tosis, “ab 
when predicted by the eae stimu- 
Sinha suggested that confoun' 
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erences were present in his study. 
aby Mishkin et al, tm Table 2,006 
between-subjects design permitting 
groups to acquire different learning sets. 
The tendency to approach new stimuli 
would enhance the effects of the learn- 
ing set in the Nonreward-N egative group 
and the reversal learning set in the Re- 
ward-Negative group, but interfere with 
the effects of the reversal learning set of 
the Nonreward-Positive group and the 
learning set of the Reward-Positive 
group. 

Harlow and Hicks (1957), Leary 
(1956), and Moss and Harlow (1947), 
all using experimentally sophisticated 
subjects, gave Reward-Positive and Non- 
reward-Negative problems, each with 
one or two pretraining trials. In all 
three studies performance on the Re- 
ward-Positive condition was inferior to 
performance on the Nonreward-N egative 
condition, as would be expected on the 
basis of the reasoning above if there was 
an initial tendency to approach new 
stimuli. (Moss & Harlow, 1947, also 
found that performance after one or two 
standard discrimination trials on which 
both stimuli were Presented, with only 
the Positive stimulus baited, was inter- 
mediate between the other two condi- 
tions, in agreement with this interpreta- 
tion.) 

Blomquist and Harlow (1962) ob- 
tained the same results in experimentally 
naive subjects, as indicated previously, 
but also found that the performance 
curves converged. The convergence in- 
dicates that the tendency to respond to 
new stimuli weakens as a result of pro- 

longed training. (The weakening devel- 
ops slowly; Blomquist and Harlow gave 
360 problems on each condition; Harlow 
and Hicks gave only 90 on each condi- 
tion; Moss and Harlow, 15; and Leary, 
10.) On the discrimination trials of the 
Reward-Positive problems, the competi- 
tion between the tendencies to approach 
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the positive stimulus and the new stimu- 
lus decreases as the latter tendency 
weakens, and performance improves. 
There is no competition on the Nonre- 
ward-Negative problems. 

One further study shows that under 
certain conditions, within-problem gen- 
eralization is not required for the de- 
velopment of a tendency to avoid novel 
stimuli. King and Harlow (1962) gave 
experimentally sophisticated subjects 
720 problems, each with one pretraining 
and one discrimination trial. For one 
group 25% of the problems were Re- 
ward-Positive, for a second group 50% 
were Reward-Positive, and for a third 
group 75% were Reward-Positive. The 
other problems were Nonreward-Nega- 
tive. 

On the Reward-Positive problems, 
Group 75 was superior to Group 50, and 
Group 50 superior to Group 25. On the 
last blocks of Nonreward-Negative prob- 
lems, Group 75 was inferior to Group 
50, and Group 50 inferior to Group 25. 
In Group 75, performance on the Re- 
ward-Positive problems was superior to 
performance on the Nonreward-Negative 
Problems; the difference was reversed in 
the other two groups and was greater in 
Group 25 than in Group 50. 

The data suggest that a strong tend- 
ency to approach novel stimuli was de- 
veloped in Group 25, in which the ap- 
Proach to novel stimuli was reinforced 
757 of the time. Group 50 developed 
a weak tendency to approach novel stim- 
uli (weaker than the tendency to ap- 
proach the positive stimulus, since per- 
formance was better than chance on the 
Reward-Positive problems). Group 75 
developed a weak tendency to avoid 
novel stimuli (weaker than the tendency 
to avoid the negative stimulus, since 
performance on the Nonreward-Nega- 
tive problems was better than chance). 
The approach response to novel stimuli 
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was nonreinforced 759 of the time in 
Group 75. 

In summary, subjects have a fairly 
weak tendency to approach novel stim- 
uli (except when the percentage rein- 
forcement of this tendency is large, as 
in King and Harlow’s 257 group); but 
if within-problem generalization occurs 
from the positive to the negative stimu- 
lus, a generalized tendency to approach 
the negative stimulus becomes stronger 
than the tendency to approach novel 
stimuli, and subjects begin to approach 
familiar stimuli. Under conditions which 
minimize within-problem generalization, 
the tendency to approach novel stimuli 
is gradually extinguished by prolonged 
training in which the tendency is 509 
reinforced. 

Transfer of Learning Sets. Learning 
sets transfer and may interfere with the 
acquisition of a different learning set. 
Shifting the subjects from one kind of 
problem to another brings about a large 
number of errors, but as the shifts con- 
tinue, progressively fewer errors occur 
(Harlow, 1959). Warren and Sinha 
(1959) gave monkeys three series of 
discrimination learning-set problems. 


Between the first and second series, the 


monkeys served in studies of discrimi- 


nation, discrimination reversal, and con- 


ditional discrimination. Between the 


second and third series, they were given 
i Perform- 


double-alternation training. 
ance improved from the first to the sec- 
ond series, but deteriorated from the 
second to the third. : 

Behar (1961a) found that experi- 


mentally sophisticated monkeys trans- 


ferred an object-discrimination learning 
th the formation 


set which interfered wi 

of an object-alternation learning set, an! 
Levine and Harlow (1959) found that 
training on object-discrimination prob- 
lems interfered with later oddity learn- 


ing. ba 
Harlow (1949) gave position-discrim- 
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nation and object-discrimination prob- 
lems to monkeys that had previously ac- 
quired an object-discrimination learning 
set and a discrimination-reversal learn- 
ing set. By the end of training the mon- 
keys could solve problems of both kinds 
fairly rapidly, but not as rapidly as when 
they had been given problems of only 
one kind. The position discriminations 
were solved less rapidly than the object 
discriminations. There was, then, inter- 
ference between the learning sets, and 
the object-discrimination learning set in- 
terfered more on the position problems 
than the position learning set interfered 
with the object discriminations. Warren 
(1959) obtained essentially the same re- 
sults using monkeys that had acquired 
an object-discrimination learning set in 

a previous study (Sinha, 1958). 
Riopelle (1953) gave five problems 
with new stimuli each day, and one daily 
reversal problem, for 63 days. In the 
last block of 14 days, the percentage 
correct responses on Trial 2 of the new 
problems were greater than the percent- 
age correct responses on Trial 2 of the 
reversal problems, indicating that the 
discrimination learning set continued to 
interfere with the reversal learning set. 
One other study (Riopelle & Chinn, 
1961) relevant to this conclusion showed 
that monkeys can learn a position habit 
and a learning set simultaneously, but 
there is interference between these re- 
sponse tendencies. On the first trial of 
every problem the correct stimulus was 
on the same side. Over the last block of 
50 problems the subjects gave 85% cor- 
rect responses on the first trial, but a 
slightly smaller percentage correct re- 
sponses On the second trial; on half of 
als the correct stimulus had 


the second tri 
changed position. In the last 200 prob- 
lems there were 12% incorrect responses 


on Trials 5 and 6, and 85% of these er- 
rors were made on the side of the posi- 


tion habit. 
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Systematic Patterns of Responses. Sys- 
tematic patterns of responses occur, not 
only after a problem has been mastered, 
when all responses are correct, but also 
in the “presolution” period. The system- 
atic response tendencies occurring in the 
presolution period have been called 
hypotheses (Krechevsky, 1932; Levine, 
1959), strategies (Bowman, 1961), and 
error factors (Harlow, 1949, 1950, 
1959). The studies summarized below 
used Harlow’s error factor model (see 
Levine, 1959, for applications of the “hy- 
pothesis” model), 

1. Position Preference: None of Har- 
low’s (1950) 11 subjects had a position 
preference in the first hundred problems, 
although 3 had significant position pref- 
erences in the second hundred problems, 
and Harlow ( 1959) has concluded that 
Position preference is an “essentially un- 
important” error factor in rhesus mon- 
keys, 

2. Position Alternation: Spontaneous 
Position alternation apparently does not 
occur in monkeys, or at most seldom 
occurs (see Behar, 1961b; Warren & 
Sinha, 1959), 

3. Stimulus Perseveration (Stimulus 
Preference) : Stimulus perseveration de- 
creases as training Progresses (Harlow, 
1950). As training Progresses, the num- 
ber of Stimulus-perseveration error se- 
quences decreases, and the duration of 
the error sequences decreases (Harlow, 
1959). Stimulus perseveration also ap- 
pears to be stronger in young monkeys 
than in mature animals (Harlow, 1959), 

Brush et al. (1961) and Harlow 
(1959) have listed stimulus character- 

istics that apparently are associated with 
spontaneous approach and avoidance 
tendencies, and have suggested that gen- 
eralization from previously encountered 
positive and negative stimuli also affects 
stimulus perseveration. 

4, Stimulus Alternation: Stimulus al- 
ternation apparently has a low proba- 
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bility of occurrence in monkeys, but may 
be related to the Response-Shift Errors, 
discussed below, Behar (1961a) studied 
stimulus alternation, but in monkeys 
that had previously acquired a discrimi- 
nation learning set. 

5. Differential-Cue Error: The differ- 
ential-cue error is the frequency of errors 
on the first trial on which the correct 
stimulus object changes position (differ- 
ential-cue trial), relative to errors on 
trials on which the correct stimulus re- 
mains in the same position (multiple- 
cue trials). The differential-cue error 
persists longer than stimulus persevera- 
tion and is a function of the number of 
multiple-cue trials preceding the first 
differential-cue trial (Harlow, 1959). 
Harlow (1950) found almost as great a 
differential-cue error in the second hun- 
dred six-trial problems as in the first 
hundred, 

The first-trial outcome does not seem 
to affect the differential-cue error; Har- 
low and Warren (1952) found about the 
Same strength of differential-cue error 
following a first-trial error as following 
a first-trial correct response. 

Finally, it has been shown that elec- 
troshock convulsions increase the 
strength of the differential-cue error. 
After a learning set had been acquired, 
the subjects in a study by Braun, Pat- 
ton, and Barnes (1952) were given 20 
electroshock Convulsions, followed by 
further learning-set problems. On the 
Postshock problems, the subjects tended 
to perseverate the response to the posi- 
tion last rewarded (differential-cue 
error). 

6. Response-Shift Errors: At least 
two kinds of Tesponse-shift error have 
been described, According to one defini- 
tion, response shift is a greater frequency 
of errors following an initial correct re- 
sponse than following an initial error. 
A second definition describes response 
shift as the number of errors following 
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a series of correct responses when the 
initial response was correct, relative to 
the number of such errors when the ini- 
tial response was incorrect. It has been 
assumed that the response-shift error re- 
sults from “a strong tendency of the 
monkey to respond to both stimuli in the 
object-discrimination learning situation 
[Harlow, 1959, p. 516].” 

Harlow (1959) has concluded that re- 
sponse shift is the most persistent error 
factor, Harlow (1950) found that the 
percentage response-shift errors in- 
creased from 29% in the first hundred 
6-trial problems to 57% in the second 
hundred. 

Although Harlow suggested in 1950 
that Hull’s (1943) construct “reactive 
inhibition” might account for response- 
shift errors, he had abandoned that pos- 
sible explanation by 1959, since “Davis 
(1956) found little evidence that work 
inhibition is either an important or con- 
sistent variable influencing performance 
of rhesus monkeys [p. 518].” The rele- 
vance of Davis’ results is debatable, 
since his results could be attributed to 
competition between the inhibitory effect 
of work (reactive inhibition) and a fa- 
cilitating effect of work on “attention” 
enting-response mecha- 
nism (increasing work sometimes in- 
creased the performance level). Harlow 
(1959) has suggested that response shift 
may be related to spontaneous alterna- 

d in rats, but spon- 


tion, such as is foun À 
taneous position alternation and stimulus 


alternation seem to be infrequent in 
monkeys. 

Early in training 
rect responses following a co! 
sponse on Trial 1 than following an error 
on Trial 1; but later the trend is reversed 
—there are more correct responses fol- 
lowing an error than following a correct 
first-trial response. Riopelle (1953) 
found the former trend in the first 105 
of 315 problems, and the latter in the last 


or some other ori 


there are more Cor- 
ect re- 


105 problems. Davis (1961) obtained 
only the former trend in his subjects, 
but they were given only 140 problems. 

Harlow and Warren (1952) gave 450 
problems with pattern stimuli, then 144 
problems with stereometric stimuli, and 
found the former trend in both series of 
problems. McConnell and Schuck’s 
(1962) subjects also exhibited the early 
trend throughout training on 832 prob- 
lems with pattern stimuli, but there was 
only minimal within-problem learning 
following an error, indicating the pres- 
ence of strong stimulus preferences. 

Harlow and Hicks (1957), using rela- 
tively naive subjects, designated the ob- 
ject chosen on the first trial as correct 
or incorrect on different problems by 
baiting both stimuli or neither stimulus 
on the first trial. (They also included 
two other conditions, discussed previ- 
ously.) There were more errors on the 
nonbaited problems than on the baited 
problems in the first block of problems 
(15 problems per condition), and about 
equal numbers of errors on the two con- 
ditions over the five remaining blocks, 
in line with the conclusion. Brush et al. 
(1961) also used relatively naive sub- 
jects and the same kind of control of 
first-trial reward. Over the first block 
of problems (20 problems per condition) 
there were more errors on the nonbaited 
than on the baited problems, and there- 
after there was little difference. " 

Behar (1961b) found both trends in 
relatively naive monkeys. The curve de- 
picting percentage correct responses after 
an initial error crossed the curve for 
percentage correct after an initial cor- 
rect response between the fourth and 
fifth blocks of 50 problems. Another 
study by Behar (1962) suggests that ane 
effect may eventually disappear. ve 
experimentally sophisticated ae 
(previously trained by Behar, < ae 
he found 91.6% correct fe we 
Trial 2 following a correct Tria 


d 


sponse, and 93.5% correct following = 
initial error. Warren (1959) also use 
experimentally sophisticated subjects, 
and found that performance was not sig- 
nificantly different following first-trial 
errors and first-trial correct responses. 
The subjects were given only 16 prob- 
lems, however, and were trained to crite- 
rion on each one. 


THEORETICAL ANALYSIS 


The term “learning set” has usually 
been used to refer to one-trial learning or 
a close approach to one-trial learning 
occurring after training on problems that 
have a common basis of solution (such 
as “approach odd stimulus” in oddity 
problems). The theoretical analysis of 
one-trial learning outlined below is based 
on Hull-Spence theory. 


Basic Assumptions 


One-trial learning can occur if (a) an 
increment in habit strength produced by 

a single teinforcement or an increment 

in inhibitory Strength produced by a 

single nonreinforcement makes the excit- 

atory potential of the Positive stimulus 
greater than that of the negative stimu- 
lus, and (b) the difference between the 
excitatory potentials of the positive and 
negative stimuli is greater than oscilla- 
tory inhibition. Spence (1956, pp. 100- 
101) has assumed that the range of oscil- 
latory inhibition is reduced as a result of 
prolonged practice in the instrumental 
conditioning situation, and it seems rea- 
sonable to assume that the reduction 
would also occur in the learning-set sitų- 
ation. To satisfy Condition b the re- 
duced oscillatory inhibition would have 
to be smaller than single increments jn 
habit and inhibition. 

Condition @ requires that the Positive 
and negative stimuli have about equal 
habit strengths and about equal inhibi- 
tory strengths before the first response 
on a problem. This would occur, theo- 
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retically, if the subject experienced a 
large number of stimulus elements on 
many dimensions (as when multidimen- 
sional and “junk” stimuli are used), and 
responses to these elements were about 
equally often rewarded and nonrewarded 
over a long series of problems. As a re- 
sult of generalization, unused stimulus 
elements on these dimensions would also 
acquire habit and inhibitory strengths 
of magnitudes about equal to those of 
stimulus elements that were directly ex- 
perienced. Generalization between mul- 
tidimensional stimuli could be attributed 
to a failure of discrimination between 
the stimuli, resulting from similarity of 
common stimulus elements, or to “pri- 
mary stimulus generalization” among the 
similar stimulus elements. It would be 
less reasonable to attribute the generali- 
zation to identical stimulus elements, 

It might be argued that the number of 
problems actually required to attain the 
learning set is too small for the equaliza- 
tion of habit strengths and of inhibitory 
Strengths, since not enough stimulus ele- 
ments could be experienced, but Rio- 
pelle and Rogers (1956) found more 
generalization in college students given 
several conditioned stimuli than in sub- 
jects given a constant conditioned stimu- 
lus. White and Spiker (1960) obtained 
the same result in Preschool children, and 
White (1961) obtained data in line with 
this finding in a discrimination learning 
task, again using preschool children. In 
a relatively short series of training prob- 
lems, the subject may receive practice 
with several members of each stim- 
ulus dimension, and therefore generalize 
widely along each dimension. (However, 
Zimmerman, 1962, found that neither 
multiple-problem training nor overlearn- 
ing increased generalization in infant 
monkeys.) 

Analyses of Specific Findings 

Learning-Set Acquisition Curves. Ini- 
tially, the curves should be Positively ac- 


S 
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celerated, since associations are learned 
to a relatively large number of new 
stimulus elements in each of the early 
problems. Later in the course of learn- 
ing-set formation, as relatively fewer new 
stimulus elements are added on each 
new problem, the acquisition curve 
should be negatively accelerated. In gen- 
eral, the acquisition curve seems to be 
ogival (Harlow, 1959). 

The theory requires that the subject 
be given practice on a large number of 
problems, and as indicated above, the 
data seem to satisfy this requirement. 
Generally, well over 100 problems must 
be given before there is evidence that a 
learning set has been acquired. 

Experimental Variables. The relation 
between within-problem learning and 
problems to criterion is obtained only if 
the maximum number of trials on each 
problem is some number greater than 
12 and less than 50. Evidence from 
studies of the solution of a single two- 
stimulus problem by monkeys suggests 
that this maximum number of trials may 
be the point in training at which over- 
learning begins. The theory outlined 
here requires that overlearning interfere 
with the formation of a learning set, 
because overlearning leads to stimulus 
perferences which would have to be over- 
come by the development of equally 
strong preferences for other stimuli be- 
fore the learning set would be evidenced. 
The habit strengths of all stimulus ele- 
ments (and their inhibitory strengths) 
must be approximately equal for one- 
trial learning to occur. Overlearning 
would raise the magnitude at which the 
habits would be equal, and therefore in- 
crease the number of training problems 
required to equalize the habits. 

The analysis also requires, however, 
that there be some lower limit to the 
magnitude at which the habits are 
equalized. Otherwise, fewer problems 
would be required for the development 
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of one-trial learning, the fewer the trials 
per problem (since weaker stimulus pref- 
erences would develop). The assump- 
tion of a lower limit is reasonable, since 
it is known that “spontaneous” prefer- 
ences occur; at least the majority of 
these stimulus preferences must be over- 
come before one-trial learning can pre- 
dominate in the subject’s performance. 
(It may be noted that this reasoning 
requires the further assumption that 
most spontaneous preferences be 
“learned” but not “overlearned.”) 
Therefore, the theory predicts a decrease 
in the number of problems to criterion 
as the number of trials per problem in- 
creases to some number between 12 and 
50, and an increase in problems to 
criterion as the number of trials per 
problem increases beyond this limit. 

The slower learning-set formation with 
pattern stimuli than with stereometric 
stimuli reflects the greater difficulty of 
the pattern problems than the stereo- 
metric problems. The difficulty of the 
pattern problems may result from gen- 
eralization between the positive and 
negative stimuli of a problem. This 
explanation would require the assump- 
tion that training on a large number of 
problems either reduces the generaliza- 
tion or reduces the oscillatory inhibition 
to a value smaller than the difference be- 
tween the conditioned and the general- 
ized habit or inhibition (the latter alter- 
native seems more reasonable, since it 
is only an extension of the assumption 
regarding the reduction of oscillatory 1n- 
hibition with prolonged training, and 
since the former alternative appears, w 
be incompatible with the conclusion 
about within-problem generalization). 

An alternative explanation 1S also a 
line with Hull-Spence theory. Only $ 
stimuli can acquire By 
be “effective,” stimuli must 
at the moment of respond- 

> 1956). Even 
943; Spence, 


fective 
strength; to 
be observed 
ing (Hull, 1 
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naive monkeys probably have appropri- 
ate observing responses for stereometric 
objects, since their history includes 
everyday experience with such objects, 
but monkeys usually have less pre-ex- 
perimental experience with patterns. 
During prolonged training, however, ap- 
propriate observing responses are ac- 
quired and eventually a learning set 
develops. 

The orienting response mechanism 
also explains the effect of the spatial 
contiguity of the stimuli and response 
loci. The probability of observing the 
relevant stimuli in the noncontiguous 
condition is low, and therefore the proba- 
bility of observing the relevant stimulus 
differences is even lower (Stollnitz & 
Schrier, 1962). 

The increase in the rate of learning- 
set formation with increase in the size of 
pattern stimuli (percentage of back- 
ground cards covered by stimuli) can be 
explained by the same mechanism, as 
noted in the review section. 

Retention o 


greater early in learnin 
when generalization is 
avoid the complication 
novel stimuli in the tes 
reversal problems could be used. The 
studies of reversal learning in successive 
problems have uniformly confirmed the 
prediction, since Performance on the re- 
versal problems gradually improves until 
late in training one-trial reversals occur 
(e.g, Harlow, 1949; Riopelle, 1953). 
Furthermore, experimentally naive mon- 
keys probably acquire a reversal learn- 
ing set more slowly than monkeys that 
have previously acquired a discrimina- 
tion learning set (Harlow, 1949), The 
analysis outlined here also predicts that 
reversal should be faster when the first 
reversal problem experienced by the sub- 


g than later, 
extensive. To 
of responses to 
ts of retention, 


ject is given late in discrimination learn- 
ing-set training than when it is given 
early. 

Within-Problem Generalization. There 
is generalization from the positive to the 
negative stimulus within a problem 
in subjects that have had prolonged 
training. The direction of the generali- 
zation results from the greater frequency 
of correct responses in a problem in such 
subjects; very few errors occur after the 
first trial of a problem, The increase in 
habit in a problem is therefore greater 
than the increase in inhibition, and the 
habit generalized from the positive to 
the negative stimulus is apparently 
greater than the inhibition developed. 

Some of the elements of the negative 
stimulus may be similar to elements of 
the positive stimulus, providing a basis 
for generalization, but temporal con- 
tiguity is apparently also required. 

Reactions to Novel Stimuli. The tend- 
ency of experimentally naive monkeys to 
approach novel stimuli may be asso- 
ciated with a “curiosity drive” (Harlow, 
1953), or more accurately with the drive 
stimuli produced by such a motive, but 
the source of the tendency is not rele- 
vant here. After prolonged training, no 
stimuli are entirely new, since at least 
some of their elements have been previ- 
ously experienced, and therefore the 
tendency to approach new stimuli should 
eventually disappear. The response 
tendencies of sophisticated monkeys are 
discussed in the review section, 

Transfer of Learning Sets. A learn- 
ing set should transfer between problems 
if the contexts in which the stimuli are 
Presented are identical and if the stim- 
uli are of the same kind. Changing the 
stimulus context can change the response 
tendencies, and the response tendencies 
would be different if the stimuli used in 
one kind of problem had few elements 
in common with those of the other kind 
of problem. If transfer occurs, it may 
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result in interference or in facilitation, 
depending on the kinds of problem, In 
most cases, however, facilitation and in- 
terference probably result from the 
transfer of performance sets, especially 
the transfer of orienting responses, ra- 
ther than of habit and inhibition. It 
seems likely that one kind of orienting 
response is acquired on position-alterna- 
tion problems, for example, and another 
kind on  stimulus-discrimination prob- 
lems. 

Systematic Patterns of Responses. 
With the exception of the response-shift 
error, all of the error factors can be ex- 
plained on the basis of spontaneous or 
previously acquired response tendencies 
to specific stimulus elements. The dif- 
ferential-cue error, for example, may be 
related to position habits, and should 
persist longer than such an error factor 
as stimulus perseveration, since the re- 
sponse to position is rewarded on each 
multiple-cue trial. 

Regarding the response-shift error, it 
has been shown that early in training 
there are more correct responses follow- 
ing a correct response on Trial 1 than 
following a first-trial error, but that this 
trend is reversed later in training. Behar 
(1961b) has concluded that: 

The negative reinforcement effect of nonreward 
is absent in naive animals but is acquired in 
successful discrimination learning [p. 51]. 


Spence (1960) has assumed that in in- 
strumental reward conditioning, 

a non-reinforcement is 
f the response tendency 
els, such as 
the earliest 


the inhibitory effect of 
related to the strength o 
and would be zero at very low lev 
Presumably would be the case in 
stages of learning [p. 98]- 


Riopelle and Francisco’s (1955) data 
support this assumption (see also Rio- 
pelle, Francisco, & Ades, 1954, who used 
somewhat different conditions, but ob- 
tained similar results). One of their 


groups was trained on problems with 
heither stimulus baited on the first trial 
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(nonbaited group) and another with 
both stimuli baited on the first trial 
(baited group). The stimulus that the 
subject responded to on the first trial 
was baited on the remaining trials and 
the other stimulus was nonbaited. After 
250 training problems, the subjects were 
given 50 transfer problems, each with a 
predetermined positive stimulus baited 
on all trials. On the transfer problems, 
the nonbaited group repeated responses 
86% of the time following an initial non- 
reward and the baited group only 45%. 

Riopelle et al. (1962) gave three 
groups of experimentally naive subjects 
four daily problems with new stimuli, 
and six daily problems each with one new 
stimulus and one recurrent stimulus. 
The recurrent stimuli were always posi- 
tive in one group, always negative in 
another, and sometimes positive and 
other times negative in the third group. 
On the “new-stimulus” problems, the 
“recurrent-positive” group gave more 
correct responses following a correct 
first-trial response than following an 
initial error throughout training (except 
in the fourth block of 10 days). In the 
“recurrent-negative” group this trend 
was obtained over the first third of train- 
ing (20 days), but the opposite trend 
was obtained over the last 40 days. These 
results confirm the general conclusion of 
this section, on the assumption that the 
negative reinforcement effect of a non- 
reward was acquired sooner and was 
stronger in the recurrent-negative group 
than in the other two groups. 3 

The increment in inhibition resulting 
from a single nonreward increases as 

ini ; 5 increment 1n 
training continues; but the incremen i 
habit strength, resulting from @ single 
reinforcement, is a nega 
ated function of habit strength before 

d would be larger 
the reinforcement an : 
A l o than later. Early in 

early in training tin habit 
training, therefore, an increm® "in inbibl- 
is greater than an incremen 
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— a single reward has a greater 
ent i m o than a single non- 
reward. Later in training, the increment 
in habit is smaller than the increment in 
inhibition, and a nonreward has more 
influence on performance than a reward. 
One might speculate that if habit and 
inhibition ever became asymptotic in 
value, performance could not rise above 
the chance level, since all stimulus ele- 
ments would have the maximum possible 
strengths of habit and inhibition. (Be- 
cause of oscillatory inhibition, responses 
would occur, but no further learning 
could occur.) Furthermore, the perform- 
ance level should decline as the approach 
to the asymptote becomes close. Mc- 
Connell and Schuck (1962) gave 832 
six-trial problems with pattern stimuli, 
and found such a decline from the 
twelfth to the thirteenth block of 64 
problems, except in a slow-learning sub- 
group (see also Schuck et al., 1961). 
However, most of the existing data on 
monkeys with extensive learning-set ex- 
perience suggest that the Prediction is 
incorrect, and if further research fails to 
confirm the Prediction, the assumption 
that habit and inhi 
will have to be modified. 


emphasized 
ion are not 
ations from 


relative perma- 
nence” of habit and inhibition, 
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The independent variable of delay of reinforcement is traced from its 
early empirical history through the systematic treatment provided by 


Hull and Spence. 
recent experimental stu 
delay of punishment on 
parative studies wi 
of reinforcement to 


The relevance of delay of reinforce- 
ment to an increasing number of theoret- 
ical issues and the need to provide a 
comprehensive review of the historical 
context into which these recent studies 
can be viewed is one purpose of this 
paper. A second purpose is to review 
the current literature critically with the 
aim of clarifying the problems needing 
additional experimental work, and fi- 
nally, to indicate the relevance of delay 
of reinforcement to several aspects of 
more complex human behavior. 


HISTORICAL BACKGROUND 


On the Nature of Learning 


A comparative study by Hunter 
(1913) apparently was the first experi- 
mental study on delay of reinforcement, 
although the theoretical significance of 
delay of reinforcement as an independ- 
ent variable can be traced to Watson’s 
(1917) experiment on delayed reward of 
a digging response. Watson found that 
a 30-second delay had no effect on the 
acquisition of the response and con- 
cluded that his results presented a diffi- 
cult problem for the view that the get- 


2 This paper was prepared in conjunction 
with a research project supported by Unite 
States Public Health Service Research Grant 
No. M-6066 (A) from the National Institute 


of Mental Health. 


The empirical findings and theoretical implications of 
dies are noted. The effects of delay of reward and 
the learning process are considered, the com- 
th human Ss are reported, and the relevance of delay 
abnormal behavior and personality is summarized. 


ting of food stamps in the most recent 
act. 

Warden and Haas (1927), after an 
earlier study by Yarbrough (1921), 
followed with an extension of Watson’s 
(1917) study; subjects were run in an 
8-unit maze rather than a problem box 
and feeding in the goal box was permit- 
ted after 1- or 5-minute delay intervals 
rather than a 30-second interval. The 
delay did not increase errors, trials to 
first mastery or to a criterion of mastery, 
although time scores showed that de- 
lay subjects spent longer in the maze. 
Warden and Haas (1927) distinguished 
between learning and performance, con- 
cluding that delay has no effect on 
learning, 

[for] even if the slowing up in speed of loco- 
motion were clearly due to a difference in 
motivation arising from the interval of delay 
it would still be difficult to connect this in 
any important way with the process of fixa- 
tion [p. 114]. 

However, Hamilton (1929), also using 
a maze task, found that a 1-minute 
delay increased both trials and time 
scores; also, she found that a 15-second 
delay in food presentation reduced the 
number of crossings of 2 charged grid 
for food reward. The essential difference 
between the two studies was that War- 
yew and Haas (1927) detained the subs 
jects in the goal box, as had Watson 


(1917); put Hamilton (1929) delayed 
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the reward by using a separate delay 
chamber. Roberts (1930), using a prob- 
lem box, was able to show that when the 
delay interval clearly separated the re- 
sponse itself from the goal, both time 
and trials to solution were increased. 
Also, Roberts noted that all subjects 
who received delayed reward showed an 
increase in errors early in learning due 
to a tendency to go directly to the goal- 
box door and that this “recency” factor 
initially interfered with the association 
of the correct instrumental response with 
the reinforcement. Thus Roberts, in dis- 
cussing his results, identified two factors 
which later became important: “cues” 
as indirect or secondary reward and the 
disrupting effect of competing responses, 
although he did not realize the full 
implications of his comments at the time, 
Studies by DeCamp (1920) and Cle- 


imilar stud 
by Warden and Diamond (1931) on 
rats, 
By 1933, the evidence Supported these 
conclusions: 


(a) Delay of reward in- 
creased time, error, and trial scores, (b) 
In order for the detrimental effects of 
delay to occur, the indirect reward from 
goal-box cues had to be eliminated, (c) 
There was no agreement on the amount 
of delay necessary to interfere with 
learning, although responses which led 
to more immediate reward (in time or 
space) were preferred to responses which 
resulted in delay of reward. (d) Al- 

though the data seemed incompatible 
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with a simple frequency-recency postu- 
late, there was no theoretical model to 
explain the data and diverse temporal 
gradients that had been found. 


Theoretical Treatment 
Delay: 1932-60 


Hull’s Goal Gradient. Hull (1932) 
proposed his now famous goal-gradient 
hypothesis which stated that: 


the goal reaction gets conditioned the most 
strongly to the stimuli preceding it, and the 
other reactions of the behavior sequence get 
conditioned to their stimuli progressively 
weaker as they are more remote (in time or 
space) from the goal reaction [pp. 25-26]. 


of the Effect of 


This hypothesis provided an explanation‘; 


for a range of phenomena 
indirectly related to delay, such as the 
choice of a shorter path to a goal (e.g. 
DeCamp, 1920), the elimination of 
errors and their order of elimination, 
and the rate of locomotion in a maze. 
Anderson (1933), by Teanalyzing some 
previously reported data (Anderson, 
1932) which had confirmed the earlier 
study of Sams and Tolman (1925), pro- 
vided support for Hulls speed-of-loco- 
motion hypothesis, Using a time-dis- 
crimination Problem, Anderson (1933) 
found a direct relationship between 
length of delay of reinforcement and 
running time. An extensive study by 
Wolfe (1934), using both a T maze 
and a black-white discrimination prob- 
lem, again confirmed that delay of Te- 
ward produces extreme inefficiency in 
learning, and demonstrated how Hull’s 
(1932) goal-gradient hypothesis could 
be applied to explain his delay of rein- 
forcement gradient. 

However, the effectiveness of the goal 
gradient as an explanation for the 
effects of delay did not eliminate all 
the inconsistencies. For example, i? 
Wolfe’s (1934) study, a delay of 
minute seriously interfered with learn- 
ing, although Within 100 trials some 


directly and” 
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learning was possible with delays of 10 
or 20 minutes; however, Riesen (1940) 
reported a discrimination study in which 
chimpanzees failed to learn in 600 trials 
with only a 4-second delay, and with 
pretraining he was able to extend the 
delay gradient to only 8 seconds. 

The experimental work by Perin 
(1943a, 1943b) considerably clarified 
the factors behind the diverse durations 
of the temporal gradients that had been 
reported and added support to the 
view that the temporal gradient itself 
extended only over a short interval. 
Perin (1943a, 1943b) employed a Skin- 
ner box, modified so that a movement in 
the correct direction delivered a pellet 
of food after a delay interval; move- 
ment in the incorrect direction was not 
reinforced. In one study (1943b) the 
bar was removed after each correct re- 
sponse and in the second study (1943a), 
it was removed after either a correct 
or incorrect response. Perin’s (1943a) 
animals showed a negatively accelerated, 
descending gradient with no learning 
of the bar-pressing response, established 
first by pretraining (50% level of cor- 
rectness), with delays as long as 30 
seconds. 

These studies which followed Hull’s 
(1932) formulation of the goal-gradient 
hypothesis led to a revision in his Prin- 
ciples (1943): 


what was originally 
ciple has turned ou 


regarded as a single prin- 
t upon investigation to 
involve two fairly distinct principles: a) the 
short gradient reported by Perin, which will 
be called the gradient of reinforcement .- . 
and (2) the more extended gradient which is 
presumably generated as @ secondary phe- 
nomena from Perin’s gradient of reinforcement 
acting in conjunction with the principle of sec- 
ondary reinforcement. The second and more 
extended gradient may . . - retain the original 
name of the goal gradient [p. 143]. 


Thus, Hull (1943) retained the idea of 
a short temporal gradient of reinforce- 
ment, but treated the extended goal 


gradient, not as a primary principle, but 
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one derivable from the basic principles 
of the system. In his final treatment, 
Hull (1952) (a) derived the gradient of 
reinforcement from other postulates, 
whereas previously (Hull, 1943, p. 178) 
it had been treated as a postulate, and 
(b) distinguished between two types of 
delay of reinforcement (Hull, 1952, pp. 
126, 132). The first case refers to a 
chain of responses, as occurs when rein- 
forcement is delayed by differing runway 
lengths; the reinforcement was seen as 
decreasing in strength as it is general- 
ized backward through the chain of 
responses to the beginning of the se- 
quence, thus yielding the gradient of 
reinforcement within a given reaction 
chain (1952, p. 126, Corollary iii, A). 
The second case involves a single S-R 
connection for which reinforcement is 
delayed and without the subject doing 
anything in particular during the delay 
interval, the limit of learning being fixed 
by the duration of the stimulus trace 
through which the goal response can 
operate to reinforce the stimulus trace 
at the end of the interval and generalize 
to the beginning instrumental act (1952, 
p. 132, Corollary iii, B). Thus, Hull 
(1952) retained the notion of a short 
primary gradient, but derived it from 
other postulates of the system, and 
further, he identified two types of delay 
of reinforcement conditions; those in- 
volving a response chain and those in 
which there was only a single response. 

Spence’s Treatment of Delay. Perkins 
(1947), using a T maze, made an at- 
tempt to eliminate as many cues as 
possible by enclosing the maze, inter- 
changing the two delay boxes from one 
side to another resulting in each gn 
box being followed by reward ba 180 
the time, and rotating the maz 


Perkins 

t the start of each day. 
He d a limit of 120 seconds 
delay on the ats to ae 
ee protien The performance of the 
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experimental subjects who had the pe 
chambers interchanged was poorer A 
the performance of a control group for 
whom the delay chambers were not 
interchanged, that is, for the control 
subjects the same delay chamber was 
always followed by reward. Presumably, 
slight differences in the delay chambers 
were sufficient to provide distinctive cues 
for differential learning, via secondary 
reinforcement, of the choice task. It 
was the data of Perkins (1947), Riesen 
(1940), and Gulde (1941; cited in 
Grice, 1948) that led Spence (1947) to 
suggest that: 
it would not seem unreasonable to hypothesize 
that there is no primary gradient of reinforce- 
ment but that all learning involving delay of 
primary reward results from the action of 
immediate secondary reinforcement which de- 
velops in the situation .... Such a hypothe- 
sis eliminates the necessity of explaining how 
reward seemingly acts backward over time to 


influence something which occurred earlier 
Ip. 7]. 


Spence went on to Propose that although 
one might eliminate secondary reinforce- 
ment from the external environment (as 
attempted by Perin, 1943a, 1943b; Per- 
kins, 1947) a possible basis for differ- 
ential secondary reinforcement might be 
Proprioceptive cues existing within the 
animal as a result of the differential re- 
sponses. Thus, Spence Proposed that 
the concept of delay of reinforcement 
would be treated economically as a sec- 
ondary reinforcement phenomena. 
Grice (1948), in a well-controlled 
study, further clarified the role of sec- 
ondary reinforcement in delayed-reward 
learning by using a discrimination prob- 
lem in order to eliminate any distinc- 
tive turning response from always being 
associated with reward, as it is in a 
T maze (eg., Perkins, 1947; Wolfe, 
1934). In separate control groups he 
introduced either distinctive white or 
black goal boxes which matched the 
discrimination choice or obstacles which 
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forced the rat to make distinctive motor 
responses upon entering the delay cham- 
ber. With immediate reinforcement, sub- 
jects learned the discrimination in 20 
trials, but with a delay of only .5 sec- 
ond, nearly 5 times as many trials were 
needed, and with a delay of 5 seconds, 
the number of trials increased to 580, 
and within the limits of the experiment 
(1,440 trials) no learning was possible 
with a 10-second delay. With the aid 
of the cues provided by either the dis- 
tinctive goal boxes or the obstacles, 
there was significantly faster learning by 
these groups. Grice’s (1948) gradient 
was much steeper than those of Wolfe 
(1934), Perin (1943a, 1943b), or Per- 
kins (1947), and more comparable to 
the extremely short, ones found by 
Riesen (1940) and Gulde (1941), Grice 
(1948) felt he had clearly demonstrated 
the effect of secondary reinforcers and 
felt that it was immediate secondary 
reinforcement which accounts for de- 
layed reward learning; further, he sug- 
gested that the fact that any learning 
occurred at all under delay conditions 
may be due to internal cues such as 
Sensory traces of the black-white dis- 
crimination which persist, in a decreas- 
ing fashion, for perhaps as long as 5 sec- 
onds, and that it is this trace which is 
Contiguous with the food reward, The 
acceptability of Grice’s (1948) specula- 
tion about stimulus traces and secondary 
reinforcement is a matter ‘of theory 
which we will return to later, but clearly, 
the temporal gradient, at least in rats, 
appears to be very short, and distinctive 
Cues are largely responsible for the dura- 
tion of the gradient over time (i.e. its 
slope). 

Spence (1956) revised his earlier 
(1947) statements on the basis of ex- 
perimental studies occurring in his lab- 
oratories (Carlton, 1954 ; Harker, 1950; 
Logan, 1952; Ramond, 1954; Shilling, 
1951). Spence (1956) recognized, 4S 
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did Hull (1952), that there were two 
types of delay of reinforcement (i.e., 2 
response chain as in runways of varying 
length and a nonchaining case where 
the delay is introduced between the 
instrumental response and the reinforce- 
ment). Rather than utilizing the notion 
of a decreasing secondary-reinforcement 
effect which acts to limit the habit 
strength of the instrumental response, 
Spence suggested (a) that delay may 
operate instead to reduce the incentive 
motivation the longer the response chain 
over which the rg must generalize, or in 
the nonchaining case, that the delay 
weakens the strength of the classically 
conditioned consummatory response; 
(b) that delay, of the second or non- 
chaining type, is analogous to experi- 
mental extinction, with the decrements 
in performance under delay of reinforce- 
ment resulting from essentially the same 
inhibitory factors that operate in experi- 
mental extinction (Spence, 1956, PP- 
148-164). 


Regarding the latter suggestion, the 
studies (Carlton, 1954; Harker, 1950; 
Shilling, 1951) reported in Spence 
(1956) show that the performance de- 
crement under delay is contingent upon 
the opportunity for competing responses 
to occur; failure to maintain 4 goal 


orientation and the occurrence of con- 


flicting responses thus provide a contri- 
bution to an additive inhibitory variable 
that reduces performance. Ramond 
(1954) provided some support for the 
additive nature of the inhibitory effect 


of delay by finding n° interaction wit 
drive level, as would be predicted from 
Hull’s formulation; this finding has been 


replicated (Renner, 1963). 

Mowrer’s Revised Two-Factor The- 
ory, Mowrer (1960) recently has dis- 
carded the concept of habit by proposing 
that all learning is conditioning, and he 
has provided a model that is applicable 


to both reward and aversive learning 


BA 


under delay of reward or punishment. 
The literature on delay of punishment 
is sparse and will be treated in a separate 
section of this paper; however, the re- 
cent revision of two-factor theory by 
Mowrer (1960) may provide the com- 
mon theoretical framework necessary for 
integrating the delay of punishment with 
the delay of reward literature. The re- 
vised two-factor theory seems to have 
its greatest importance in accounting 
for the effects of cue utilization and sec- 
ondary reinforcement with respect to 
learning and extinction under delay of 
reinforcement. 

Mowrer (1960) proposes that the af- 
fective component of the reaction made 
to either decremental or incremental 
reinforcement becomes classically condi- 
tioned to either the response produced 
or external stimuli. He assumes that 
traces of such stimuli may persist for 
as long as 30-45 seconds which yield 
four categories of temporal gradients 
of reinforcement. After such condition- 
ing these stimuli become positive or 
negative secondary reinforcers which 
may “bridge” an interval between stim- 
ulation and reinforcement longer than 
30-45 seconds; the conditioned stimuli, 
or secondary reinforcers, presumably 
have gradients of lesser extent and level, 
but are capable of establishing new 
habits (i.e., second-order conditioning of 
hope or fear). Mowrer derives support 
for his position by considering much of 
the same literature as Spence (1947) 
and Hull (1952), but arrives at the dif- 
ferent conclusion that there is a temporal 


gradient equal to the time interval over 
which trace conditioning is possible. 


Mowrer regards the conditioned second- 
ary reinforcers as the habit, rather than 
a derived reward which in turn acts as 


an immediate reinforcement for an S-R 


forcement €: i 
from his notions, especially 
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F ect to the role of both positive 
here ans A secondary reinforcement in 
both delay-of-reward and delay-of-pun- 
ishment learning (Mowrer, 1960, pp. 
tel Issues. The history and 
the theoretical hypotheses on delay of 
reinforcement provide a context in which 
to classify and evaluate the recent ex- 
periments (primarily post-1956). Some 
of the questions to be considered are: 
What is the evidence for or against the 
notions of a primary gradient of rein- 
forcement versus a derived gradient? 
What is the role of secondary reinforce- 
ment and cue utilization in delayed re- 
ward learning? How does delay produce 
its detrimental effect? If nonreward (or 
extinction) is the limiting case of delay 
of reinforcement, then, the effects of 
delay as a frustration, the partial rein- 
forcement effect, and the role of compet- 
ing responses should have a direct rela- 
tionship to delay of reward learning. 
What is the similarity between delay of 
punishment or escape and delay of posi- 
ent with respect to the 
mporal gradients? What 
: of comparisons across 
species, especially when extended to the 
human level? And finally, are there 
some implications of these issues for 
more complex response patterns, such 
as conflict behavior where rewards and 
punishments are both administered, but 
not simultaneously? These are the more 
important issues in terms of which the 
recent literature can be examined. 


nature of the te 
are the results 


RECENT EMPIRICAL AND THEORETICAL 
FINDINGS 


On the Nature of the Temporal Gradient 


Primary or Derived. The question of 
the existence of a “true” temporal gradi- 
ent of reinforcement (e.g., Hull, 1952) 
versus a derived gradient (Spence, 1947, 
1956) has continued to interest several 
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investigators. A unique experiment by 
Mathers (1953) attempted to impose 
even greater control on secondary rein- 
forcement than Grice (1948) by elimi- 
nating any distinctive aspect of the stim- 
ulus trace of the discrimination task. 
Recall that Grice (1948) had speculated 
that it was this stimulus trace that per- 
mitted any learning at all. Mathers 
(1953) used a Y maze and ran his ani- 
mals, on any given day, under either 
food or water drive, but never under 
both drives. On a water-deprived day, 
a white choice was followed by water, 
and on food-deprived days, black was 
rewarded by food, but on any given day 
only one reward was present; different 
groups experienced different delays of 
reinforcement, Thus, each stimulus trace 
was followed by reward equally often 
and should have acquired equal rein- 
forcing power, therefore eliminating any 
differential effect occurring because one 
of the stimulus traces was always fol- 
lowed by reward. His rats learned to 
go to the correct side on any given day 
with delays of up to 20 seconds, although 
not with a high level of accuracy, and, 
as expected, subjects with a shorter de- 
lay performed consistently better than ° 
subjects with longer delay intervals. 
Mathers (1953) concluded that his data 
Supported the concept of a temporal 
gradient. 

Mowrer (1960, p. 360), by consider- 
ing delayed-reward learning as trace 
conditioning, argues for a primary tem- 
Poral gradient of 30-45 seconds dura- 
tion; he considers the shorter empirical 
gradients (e.g., Grice, 1948; Riesen, 
1940) an atypical function of the com- 
plex discrimination task used, which he 
regards as insensitive for studying delay 
of reinforcement since a sign rather than 
a specific response must be learned. 
Thus, for Mowrer, the gradient repre- 
sents the decreasing effectiveness of a 
reinforcement as it is more remote i? 
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time from the response-produced stimuli 
associated with a specific response. But, 
the justification is not clear for Mowrer’s 
implicit distinction that the stimulus 
trace of external stimuli (as in discrimi- 
nation learning) should be effective over 
a shorter duration than the trace of 
distinctive internal response-produced 
stimuli (as in position learning) for the 
conditioning of hope or fear, especially 
since in a discrimination problem the 
negative stimulus also takes on avoid- 
ance properties (Grice & Goldman, 
1955) which may aid learning. 

Two other investigators, one using re- 
ward with pigeons (Myers, 1958) and 
the other using shock and a conditioned 
fear response with rats (Davitz, Mason, 
Mowrer, & Viek, 1957) likewise inter- 
preted their results as supporting a pri- 
mary temporal gradient and refuting 
Spence’s (1947) earlier suggestion that 
all learning under delay is due to sec- 
ondary reinforcement. Myers (1958) 
concluded that although differential cues 
facilitate learning, they are not essential. 
Also, Lawrence and Hommel (1961) 
recently have questioned the stimulus- 
trace theory as put forth by Grice 
(1948). a ; 

The use of electrical brain stimulation 
would seem to offer a useful technique 
for studying the nature of the temporal 
gradient; but, apparently this method 
has not yet been explored. Apart from 
this “absolute” problem, the variables 
which affect the characteristics of the 
empirical temporal gradient (e.g, mod- 
ify its slope or duration) have been 
investigated in their own right and have 
Provided some interesting and theoreti- 
cally useful observations. 

Secondary Reinforcement , 
(eg., Brackbill & Kappy, 1962; Grice, 
1948; Renner, 1963; Spence, 1947), it 
has been assumed that cues present m 
the learning situation acquire secondary 
reinforcing power which may greatly 


. Frequently 
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extend the duration of the temporal 
gradient over time. The argument is 
that any stimulus event which is associ- 
ated with primary reinforcement be- 
comes conditioned to it and thus 
provides immediate secondary reinforce- 
ment for the correct instrument response. 
Grice and Goldman (1955) demon- 
strated that in a discrimination problem 
the avoidance of cues associated with an 
incorrect response also operates to ac- 
count for choice learning under delay 
of reward; thus, where positive and 
negative cues are present, both second- 
ary reward and avoidance appear to be 
operating. Also there is indirect evi- 
dence that distinctive cues act as sec- 
ondary reinforcers as Spence (1947) 
suggested; for example, Kelleher (1957), 
in an experiment on token learning, 
found that performance was inversely 
related to the delay in exchanging the 
chips for a primary reward, a delay of 
1 hour leading to extinction. However, 
only two investigators (Jenkins, 1950; 
Peterson, 1956) independently demon- 
strated that such cues are capable of 
establishing a new response, and only 
Jenkins assessed the relative reinforcing 
power of such cues by obtaining a de- 
rived secondary gradient, that is, the 
longer the delay between the cues and 
the reinforcement the less effective was 
the cue for establishing a new response. 
Thus, at least in some conditions, cues 
become reinforcers, but the conditions 
under which cues are selected and uti- 
lized is not clear. 
Cue Utilization. 
Lutz & Perkins, ! 


have demonstrated t di 
cues are present, they facilitate the 


performance of subjects when reward is 
delayed, although these studies do not 
provide independent evidence that such 
cues actually would be capable of rein- 
forcing a new response- Lutz and P 
kins found that the length of delay made 


Several studies (e.g., 
960; Renner, 1963) 
hat when distinctive 
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no difference, whereas ee 
that the effectiveness o 
ti erformance may be related to the 
et ol the delay interval. These ui 
ings suggest that delay may be relate 
meaningfully to the breadth of learning 
hypothesis (see Kausler & Trapp, 1960) 
as one of the factors leading to less 
focused behavior. In addition, the avail- 
ability of positive cues may contribute 
to the resistance to extinction that re- 
sponses learned under delay-of-reward 
exhibit (to be discussed later). Thus, 
the utilization of cues, which may then 
become secondary reinforcers, seems to 
be dependent on the delay more than 
on mere contiguity with reward (Lutz & 
Perkins, 1960; Peterson, 1956: Renner, 
1963) just as partial reinforcement 
Notterman, 1951) or discrimination 
training (Wike & McNamara, 1957b) 
may facilitate the reinforcing effect of 
a distinctive stimulus more than the 
mere contiguity with reward, However, 
further work js needed to clarify the 
role of cue utilization in delayed-reward 
learning and the relationship of cues to 
the breadth of learning hypothesis and 
secondary reinforcement, 

In a related study (Holder, Marx, 
Holder, & Collier, 1957), the effects of 
delay (interpreted as frustration) was 
conditioned to a conditioned stimulus 
( CS); which in turn, Prompted faster 
running in a second Tunway because of 
the increased motivation resulting from 
the conditioning of the delay-produced 
frustration drive to the Cs, This study 
demonstrated that delay can also result 
in an external stimulus taking on the 
power of a negative secondary reinfor- 
cer. The second implication of this 

study—that is, the motivational aspects 
of lack of an expected reinforcer—wil] 
be discussed later; what remains to be 
clarified is whether cues brought into 
operation by or during the delay are 
facilitating performance by acting as 
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Positive secondary reinforcers which 
span the time interval, or if such cues 
take on negative secondary reinforcing 
power and facilitate postdelay perform- 
ance through an increase in the motiva- 
tion. The details of cue utilization under 
delay of reinforcement remains to be 
specified, but delay versus goal-associ- 
ated cues may bea necessary distinction. 

Interactions. The runway studies of 
Ramond (1954) and Renner (1963) 
have demonstrated that drive level and 
delay combine additively, with an in- 
crease in drive level elevating the gra- 
dient. Apparently, the Drive Level X 
Delay interaction has not been studied 
in a choice situation. 

Logan (1960, p. 58) reports that 
delay and amount of reward combine 
in additive fashion to determine response 
speed; again, data are available only for 
runway performance and not for choice 
behavior. However, as Logan (1960) 
cautions, 


[the] units-of-measurement question is of spe- 
cial significance in interpreting interactions . . » 
because on only the assumption that response 
speed is a linear estimate of response tendency 
can the obtained additive interactions in per- 
formance be taken as indicating additive inter- 
actions among the relevant variables [p. 60]. 

It is clear that the empirical temporal 
gradient of reinforcement is a negatively 
accelerated decreasing function, but the 
exact nature of the gradient is still in 
question, and how it is modified by 
other variables in a variety of problems 
is yet to be specified. More work will 
be required in order to establish the 
manner by which delay of reinforcement 
combines with other variables to control 
learning and performance. 

Other Observations. Ferster (1953) 
and Dews (1960) found that a pecking 
response could be maintained with a de- 
lay that ordinarily would have produced 
extinction (e.g., 60 seconds), by first es- 
tablishing the response with short delays 
and gradually increasing the delay inter- 
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vals. Consistent with this is the report 
by Lawrence and Hommel (1961) that 
delays in a Grice box can be extended 
to 60 seconds without any decrease in 
accuracy after a discrimination had been 
learned. Thus, the experience of the 
organism with a specific response is a 
relevant variable in the ability to func- 
tion with delayed reinforcement, at least 
with respect to maintaining an estab- 
lished response (Dews, 1960), although 
there is apparently no evidence on the 
possible existence of a learning how to 
Jearn-under-delay of reinforcement phe- 
nomena. It seems reasonable to expect 
that experience with delay of reinforce- 
ment may be an important factor in 
the general ability of an organism to 
function under delay of reinforcement. 

Levin and Forgays (1960) have dem- 
onstrated that sensory change can act as 
a reinforcer in a maze, and that delay in 
the sensory change leads to a delay-of- 
reinforcement gradient similar to those 
resulting from the use of traditional pri- 
mary reinforcers. 


One of the most interesting and sys- 
f delay of reinforce- 


(1960; Logan, 
55) who has studied 


forcement. 
ment is varied be 
likely values (eg. 1 and 9 seconds) that 


it will lead to faster performance than at 
a constant delay at the mean of the two 
values (e.g. 5 seconds) and approaches 
the performance of a group with a 
constant delay at the shorter of the two 
values (e.g., 1 second) (pP- 70-71, 73- 
74), Peterson (1956) failed to confirm 
this last aspect of variable delay as have 
Wike and Kintsch (1959) with partial 
delay; however, Pubols (1962) has con- 
firmed the former conclusion for choice 
behavior, that is, subjects will show a 
preference for a varied reward over a 
constant delay at the mean of the two 


values; this finding follows from the 
assumption of a negatively accelerated 
decreasing gradient, which Logan (1960, 
P- 90) derives from the assumption that 
incentive is related to the r, which be- 
comes conditioned to the traces of cues 
(stimuli) present when the instrumental 
response was completed. 

With respect to correlated delay of 
reinforcement (Bowrer, 1961; Logan, 
1960), a subject can be trained to run 
more slowly by delaying reward when 
his speed exceeds a given level. Daven- 
port (1962) has extended this type of 
work by correlating the amount of re- 
ward with delay (e.g., large reward with 
long delay versus small reward with 
short delay) producing equal reinforce- 
ment value contours, that is, there are 
critical delay values at which the two 
response strengths for two different re- 
ward magnitudes are equal. A unit in- 
crease in delay can be balanced by a 
log-unit increase in magnitude. 

Studies which pit one major variable 
against another, or in Logan’s (1960, p. 
69) terms, negatively correlated rein- 
forcement, are of theoretical and em- 
pirical importance, for the information 
provided about the relative effects on 
performance of manipulations of such 
major variables as delay, deprivation, 
and amount of reward, should lead to a 
clarification of their corresponding in- 
tervening variables and of behavior 
theory in general. It is perhaps too 
early to draw any conclusions, but it 
seems apparent that delay of reinforce- 
ment is one of the relevant variables 
that will contribute to this approach, and 
a logical extension of Logan’s work is 
to study choice behavior (e.g., Daven- 


port, 1962; Pubols, 1962). 


Delay of Reinforcement and Extinction 


Delay of re 
weaker response 
acquisition meas 


ward is associated with a 


strength as long as an 
ure is used; however, 
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inction measure is utilized, 
n pte poe some conditions, the 
ee statement can be made that 
delay of reward is associated TEA 
stronger response strength. The initia! 
observation (Crum, Brown, & Bitter- 
man, 1951) of the increased resistance 
to extinction following partial delay of 
reinforcement has stimulated a series of 
experiments and raised enough problems 
to make any simple explanations of the 
effects of delay of reinforcement unlikely 
and has linked delay of reward to 
theories of partial reinforcement, 

The study by Crum et al. (1951) 
has been confirmed in a reversal task 
(Kintsch & Wike, 1957) and in runway 
studies (e.g., Logan, Beier, & Kincaid, 
1956; Peterson, 1956; Scott & Wike, 
1956; Wike & McNamara, 1957a). Two 
failures of confirmation are reported 


ay of reinforcement 


was performing 


all groups were considerably below 
asymptotic level. There 


dence to conclude that at least under 
conditions where the delay interval js 
relatively long (e.g., 30 seconds) and js 
present on both acquisition and extinc- 
tion trials, partial or variable delay of 
reinforcement increases resistance to ex- 
tinction, although the exact role of such 
complicating factors as patterning ef- 
fects, trial distribution, cues, secondary 
reinforcement, and delay-produced frus- 
tration are far from being understood. 
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But, even more fundamentally, it is not 
clear if the crucial factor in producing 
this resistance to extinction is the delay 
per se, or if it is the variability, that is, 
the fact that the delay is present on only 
part of the trials. The few studies which 
used constant delay have yielded incon- 
sistent results. Fehrer (1956) using a 
runway, and Pubols (1958) using a dis- 
crimination reversal problem, found that 
constant delay of reinforcement alone is 
sufficient to increase resistance to extinc- 
tion; however, Logan et al. (1956; Lo- 
gan, 1960) have obtained negative re- 
sults and Renner (1963) inconclusive 
results. Thus, in the few studies employ- 
ing constant delay of reinforcement, it is 
not clear if delay alone is sufficient to 
produce resistance to extinction. More 
experimental work is needed to first dem- 
onstrate conclusively if delay of rein 
forcement during acquisition is sufficient 
to produce resistance to extinction, or if 
it is the partial or variable aspect that is 
necessary, and second, to explain what 
the crucial factors are in producing the 
apparent contradictions reported thus 
far. 

The diverse experimental findings on 
extinction have made it, as a general 
phenomena, difficult for any general 
theory to handle (see Kimble, 1961); 
and just as Partial reinforcement has 
Provided a dilemma, so possibly may 
delay of reinforcement present some 
unique problems, Logan (1960) con- 
siders the problems of delay and partial 
reinforcement as essentially identical, for 
Partial reinforcement is the extreme case 
of simultaneously varied delay and 
amount of reward with a perfect posi- 
tive correlation, “Partial reinforcement 
as the limiting case of increasing the 
Tange of variation in reward (e.g., delay 
roduces the greatest re- 
inction [Logan, 1960, p- 
the greater the variation 
variable) delay, the more 


195].” Thus, 
in partial (or 
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re it is to the limiting case of par- 
ial reinforcement and the greater the re- 
sistance to extinction; the data essen- 
tially support this prediction (Kintsch 
& Wike, 1957; Logan, 1960; Logan et 
al., 1956) and the prediction is consist- 
ent with the literature on partial delay 
(e.g., Crum et al., 1951; Scott & Wike, 
1956; Wike & McNamara, 1957a) in 
which a variable 0-30 seconds delay 
pattern is necessary in order to produce 
the greater resistance to extinction. Re- 
call, 30 seconds has been considered as 
the maximum limit of the temporal gra- 
dient (Hull, 1952). If Logan’s (1960) 
position can be supported, then the delay 
problem has been translated into the old 
problem of partial reinforcement or vice 
versa. However, two comments are 
relevant: 

First, additional study of constant de- 
lay of reinforcement is needed, for if this 
too retards extinction (Pubols, 1958), 
then there is a strong case for an addi- 
tional paradox where by delay itself re- 
tards the growth of a habit but results in 
slower extinction; if this is so then delay 
must be treated as a unique case rather 
than as a mere variation of partial rein- 
forcement. 

Second, the extinction studies on pat- 
tial delay of reinforcement were pri- 
marily concerned with furthering an 
understanding of partial reinforcement 
and extinction. Several studies were mi- 
tiated to test the Hull-Sheffield general- 
ized-decrement hypothesis (eg., Crum et 
al., 1951), or interpreted in terms of the 
Mowrer-Jones discrimination hypothe- 
sis (eg, Fehrer, 1956), O" an. tere 
of extinction of the occurrence of rg to 
the cues of nonreinforcement (Logan, 
1960), or in terms of frustration theory 
(e.g., Renner, 1963). Most of the litera- 
ture concerning the effects of delay on 
extinction has been interpreted in terms 
of one of these theories, and the study 
of delay of reinforcement may contribute 


to a further testing and modification of 
these theories. 

Thus, delay is relevant to a number of 
theoretical issues, for example, delay 
may cause frustration which in turn may 
effect drive level; it may introduce com- 
peting responses or produce conflict 
through conditioned avoidance, and such 
effects, and/or the delay itself, may in- 
fluence the degree to which cues are 
utilized and the positive (Grice, 1948) 
or negative (Grice & Goldman, 1955) 
secondary reinforcing power of such 
cues; this account by no means exhausts 
the list of potential complexities, but 
does, perhaps, emphasize the point that 
any comprehensive treatment of delay 
has not yet been achieved. Before turn- 
ing to a consideration of additional 
theoretical problems to which the effects 
of delay are relevant, it may be well to 
examine first in greater detail the con- 
sistencies and contradictions between 
constant and partial (or variable) delay 
of reinforcement. 

The major inconsistency is the failure, 
on some occasions, of constant delay to 
yield increased resistance to extinction; 
however, where greater resistance has 
not been found the subjects have been 
removed at the end of the runway with- 
the customary detention (Logan, 
1960, pp. 185-186; Logan et al., 1956) 
in the delay chamber, or delayed in the 
goal box rather than in a delay chamber 
(Wike & Kintsch, 1959). Positive re- 
sults have been reported (Pubols, 1958) 
when such variations did not occur dur- 
ing extinction. Logan (1960, p- 185) in 
qualifying his conclusion that constant 
delay does not increase resistance to ex- 
tinction, noted that resistance to extinc- 
tion may be a function of the delay in- 
terval (detention) during acquisition and 
detention during extinction. Detention 
time on nonreinforced niie o delayed 

i may be highly relevant, as indi- 
m ei by Pubols (1962) who 


out 
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ifferent rates of learning to be 

sa ae of time spent in the delay 
chamber on nonreinforced trials, or as 
directly supported by Peterson (1956) 
who found resistance to extinction a di- 
rect function of time spent in the delay 
chamber, by Kintsch and Wike (1957) 
who found resistance to extinction was 
increased by a greater length in partial 
delay, and by Wike and McNamara 
(1957a) who found partial support for 
the hypothesis that resistance to extinc- 
tion is related to the percentage of de- 
layed trials. Thus, constant delay as well 
as partial delay may produce increased 
resistance to extinction Proportional to 
the length of delay, but, the effect may 
be obscured for constant delay by the 
fact that partial- (or variable-) delay 
groups may reach a terminal acquisition 
level of performance similar to the im- 
mediate group, whereas the constant- 
delay group has a consistently lower 
terminal performance level, and although 
their rate of extinction (slope of curve) 
may be slower than the immediate group, 
no absolute differences are found (Ren- 
ner, 1963; Wike & Kintsch, 1959). In 
Contrast, groups trained under variable 
or partial delay may start extinction at 


e a e same initial point as the 
Immediate group and 


comparisons between i 
forcement, partial (or varied), and con- 
stant delay, but there is some evidence 
that when immediate and constant-delay 
groups are similar, that constan 
increases resistance to extinction (eg., 
Pubols, 1958; his subjects reached a 
similar acquisition criterion, although the 
amount of overlearning was not identi- 
cal). 


t delay 


Theoretical Implications and Problems 
for Research 


As was implied in the last section, 


there are probably insufficient empirical 
data to give a systematic account of de- 
lay of reinforcement that will explain all 
of the acquisition and extinction aspects 
of the phenomena; but there are several 
theoretical positions that are related to 
performance under delay and which pro- 
vide numerous, albeit fragmented, ac- 
counts for the effects of delay of rein- 
forcement, 

Frustration Theory, Frustration 
theory (Amsel, 1958; Amsel & Roussel, 
1952) is able to provide a rather com- 
plete theoretical account of the effects of 
delay. Presumably, the introduction of 
a delay should be frustrating and pro- 
duce conditioned avoidance and cause a 
decrement in performance early but not 
later in learning, as was observed by 
Renner (1963), because late in learning 
the frustration should become condi- 
tioned to running; however, during ex- 
tinction, when the immediate-reward 
subjects encounter nonreward and frus- 
tration for the first time, differential 
rates of extinction should be observed. 
Thus, resistance to extinction under de- 
lay may be dependent upon prior prac- 
tice in performing the instrumental 
response in the presence of delay (frus- 
tration) cues. For example, Davenport 
(1962) observed an inhibitory function 
of delay (presumably a frustration) 
which increases during the early learn- 
ing trials; also Fehrer (1956) noted an 
absence of any distinct emotional re- 
sponse during extinction by subjects that 
had experienced delay, but in other 
animals who had not experienced delay, 
the emotional reaction was present and 
the subjects extinguished faster, 

Holder et al. (1957) found that an 
intramaze delay decreased runway speed 
before the delay interval and increased 
speed in the segment after the delay. 
Presumably, the frustration and condi- 
tioned avoidance (Amsel, 1958) de- 
pressed initial runway performance, but 
the increased drive resulting from the 
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frustration increased subsequent per- 
formance, However, this confirmation of 
frustration theory by Holder et al. 
(1957) was not replicated in a similar 
study by Wist (1962). 

Delay and Cue Utilization. As was 
noted previously, delay of reinforcement 
apparently can act to facilitate the use 
of cues. Perhaps the important effect of 
delay (detention) with respect to in- 
creased resistance to extinction is to in- 
crease the breadth of learning (Ester- 
brook, 1958; Kausler & Trapp, 1960) 
and facilitate the possible role that cues 
may serve. Distinctive delay-associated 
cues which initially may become condi- 
tioned to the negative effects of frustra- 
tion later may be counter conditioned so 
that greater tolerance for responding in 
the presence of frustration-associated 
cues is acquired, thus producing an ini- 
tial performance decrement but later 
contributing to resistance to extinction. 
This interpretation would mean that 
differential resistance to extinction be- 
tween delay and nondelay groups would 
be a joint function of: greater tolerance 
for nonreinforcement for the delay 
groups and frustration and avoidance by 
the nondelay group who encounter non- 
reinforcement for the first time during 
extinction, Whereas, distinctive goal- 
associated cues, that is, presumably time- 
bridging secondary reinforcers, may 
facilitate or make possible extended de- 
layed-reward learning and, as secondary 
reinforcers present during extinction 
trials, increase the resistance of the re- 
sponse to extinction, although again the 
delay interval may be necessary to acti- 


vate the cues. 

The important role of c 
lustrated by the Perkins (1947) study 
in which interchanging of presumably 
identical delay boxes significantly inter- 
fered with the solution of a maze prob- 
lem under conditions of delayed ren- 
forcement. Thus, detention time, cue 
utilization, and the possible reinforcing 


ues is well il- 
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(Peterson, 1956) or inhibiting effects 
(Grice & Goldman, 1955) of such cues 
may be expected to provide some com- 
plex and puzzling interactions. 
pate r 
a ory in its traditional 
sense is probably an incomplete explana- 
tion there is ample evidence that op- 
portunity for competing responses 
(Spence, 1956, pp. 163-164) exists un- 
der delay conditions. In fact, one study 
(Pubols, 1958) provided an analysis of 
the acquisition and extinction data in 
terms of competing responses. The pos- 
sibility exists that resistance to extinc- 
tion may vary with the number of 
competing responses that have been ex- 
tinguished over acquisition trials and 
must be readapted in order to break 
down the habit during extinction. Thus, 
some systematic variation in the oppor- 
tunity for such responses may be corre- 
lated with slower acquisition and greater 
resistance to extinction. Unfortunately, 
there is no study of the relationship be- 
tween a range of possible competing re- 
sponses (as one type of interference) 
and resistance to extinction. It does not 
seem likely that a response competition 
theory is sufficient, but, as was noted in 
the discussion of frustration, which is 
also a brand of interference theory, the 
data (eg, Davenport, 1962; 
Renner, 1963) warrant the conclusion 
that delay does have inhibitory functions 
that accumulate with trials, especially 
early in learning (recall the early ob- 
servations of Roberts, 1930). k 
The conclusions reached to this point 
may be summarized as: (a) Constant 
delay of reinforcement will retard ac- 
quisition. (b) Partial or variable delay 
may or may not limit acquisition-re- 
sponse strength depending on the experi- 
that is, number of 


empirical 


mental conditions, 
trials, percentage of delayed reinforce- 
ments, and length of delay intervals. 

tard resistance 


(c) Constant delay may re 
to extinction, providing: Cues are pres- 
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ent to be “activated” by delay ; sa 
cues act as secondary reinforcement ‘or 
the response and/or making the response 
to frustration associated cues is learned; 
the detention during acquisition and ex- 
tinction is similar so as not to aid the 
discrimination between acquisition and 
extinction conditions; that appropriate 
comparisons can be made, that is, delay 
and immediate groups have reached a 
similar criterion so that there is sufficient 
opportunity for the extinction curves to 
diverge at some point above no per- 
formance. (d) Variable delay will lead 
to increased resistance to extinction the 
longer the maximum delay and possibly 
the higher the percentage of delayed 
trials, and is essentially the same phe- 
nomenon as partial reinforcement. (e) 
The effects of delay combine additively 
with such other variables as drive and 
amount of reinforcement. 


ADDITIONAL TEMPORAL ASPECTS OF 
REINFORCEMENT 


Delay of Punishment 


The effectiveness of delayed punish- 
ment versus delayed reward has not been 
explored. Kimble (1961) reports only 
three studies (Bevan & Dukes, 1955; 
Kamin, 1959; Warden & Diamond, 
1931) on the delay of punishment gradi- 
ent and comments that it “would be 
interesting to know whether this gradient 
is approximately the same for the situa- 
tion in which the punished response was 
established with a Positive reinforcer 
[p. 155].” Although Kimble’s question 
cannot be answered, there are a few gen- 

eral statements that can be made2 Two 
of the three studies obtained a tempo- 
ral gradient similar to ones obtaineq for 


2 Delay of reward and delay of punishment 
experiments which use the same apparatus and 
require an identical instrumental position re- 
sponse currently are being conducted. The ac- 
quisition and extinction of these responses, 
established under varying drive and cue condi- 
tions, should yield further information on the 
delay of reward and delay of punishment 
gradients. 
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food reinforcement. Kamin (1959) ob- 
tained a temporal gradient by punishing 
an avoidance response after varying in- 
tervals of delay; the inhibitory effects 
of the punishment declined as a function 
of delay and reached an asymptotic level 
at 40 seconds. The slope of this temporal 
gradient of inhibition was more or less 
comparable with reward established gra- 
dients, but not with the flat inhibitory 
gradient reported by Bevan and Dukes 
(1955), which was produced by punish- 
ing the incorrect choice in a Y maze. 
However, Bevan and Dukes’ (1955) 
study and the early study by Warden 
and Diamond (1931) involved the use 
of food reward; thus, no uncomplicated 
direct comparison between delay-of-re- 
ward and delay-of-punishment gradients 
can be made. The only apparent “pure 
delay-of-punishment gradient in which 
the punishment is used to establish an 
instrumental response is the unpublished 
study of Butler (1959) which confirms 
the earlier studies by yielding gradients 
similar to those obtained by reward. 

Additional evidence on the nature of 
the temporal gradient of punishment 
can be found in a study by Azrin (1956) 
or in the pair of experiments by Kamin 
(1957a, 1957b) ; these later studies dem- 
onstrated a gradient of delay of second- 
ary reward (delay in CS termination) on 
avoidance learning. 

The apparent parallel between delay 
of reward and delay of punishment is 
highlighted by the Crum et al. (1951) 
study which showed that varied delay 
of punishment leads to greater resistance 
to extinction than immediate punish- 
ment. Thus far there have been n° 
systematic studies on extinction of in- 
strumental responses established under 
delay of punishment. 

Sidman (1954) indirectly demon- 
strated a delay-of-punishment gradient; 
his subject could postpone the occu! 
rence of a brief shock by either a con- 
stant or a variable time interval, depend- 
ing on which of two bars was presseC: 
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Small time differences between the bars 
had an effect on the response when the 
delay interval was short, but when the 
delay is long no learning can occur with- 
out a larger time difference between the 
bars. 

There are many unknowns with re- 
spect to the temporal gradient of punish- 
ment. The few studies available sug- 
gest a similarity with reward data, but 
it is doubtful if, as empirical data accu- 
mulate, it will be more than just a super- 
ficial similarity. For example, there are 
indications in Bevan and Dukes’ (1955) 
experiment that shock may produce 
anxiety that disrupts the gradient. A 
further confusing note is the finding 
(Knapp, Kause, & Perkins, 1959) when 
cues are very distinct and a delay in 
food reward is constant regardless of 
choice, a subject will prefer immediate 
shock and then a delay interval in prefer- 
ence to delayed shock (delay of 30 sec- 
onds and then shock) because the 
distinct differential delay stimuli acquire 
conditioned fear arousal and the im- 
mediate occurrence of the shock permits 
preparatory responses which may de- 
crease its intensity. This interpretation 
of their data requires that distinctive 
differential stimuli are present; by way 
of contrast, Butler’s (1959) study- of- 
fered subjects a choice between immedi- 
ate or delayed punishment, and the 
Preference was clearly for delayed pun- 
ishment, but his situation had few if 
any distinctive cues, whereas the Knapp 
et al. (1959) study had many cues, and 
in addition at the end of the delay, pro- 
vided food reward in order to elicit the 
choice response in the first place. This 
Procedure limits the range of shock that 
can be used. 


Delay of Reinforcement with Human 
Subjects 


Delay of reinforcement largely has 
been confined to the study of animals, 
specially rats, and it is only recently 

at a number of studies with human 


subjects have appeared. The studies are 
only roughly comparable in that many 
of them use delayed feedback of knowl- 
edge of results on the learning of some 
rote skill rather than delay of a primary 
reward. A further problem is the role of 
verbal cues which are out of the experi- 
menter’s control, but perhaps very in- 
fluential. The important comparative 
question is whether the basic generaliza- 
tions available from animal experiments 
also apply to human learning situations. 
Specifically, does delay retard acquisi- 
tion, does it influence retention or ex- 
tinction, and what role do “cues” play 
in changing the acquisition or extinction 
(forgetting) of responses learned under 
delay of reinforcement? 

The majority of studies have been 
concerned with the effect of a delay of 
knowledge of results on performance of 
a simple motor task and with one excep- 
tion (Greenspoon & Foreman, 1956) 
have yielded negative results, that is, 
delay of knowledge of results does not 
interfere with performance (Bilodeau & 
Bilodeau, 1958; Bilodeau & Ryan, 1960; 
Denny, Allard, Hall, & Rokeach, 1960; 
McGuigan, 1959; Noble & Alcock, 1958; 
Saltzman, Kanfer, & Greenspoon, 1955); 
one additional study had inconclusive 
results (Ryan & Bilodeau, 1962). These 
studies have used delays that ranged up 
to 7 days (Bilodeau & Bilodeau, 1958; 
Ryan & Bilodeau, 1962) although the 
as from 0 to 30 seconds. 
Jay of knowl- 
form- 


ance decrement, an 
deau and Bilodeau (1 y 
et al. (1960) demonstrate that the oe 
cial delay is the time between — : 
(intertrial interval) and not f . be 
between response and feedback; ar 
only long interes esi resp 
aging to performance. a 

ne periments where some pace 
or memory is involved, gto e 
contradictory- Saltzman O ace 
Bourne (1957) reported P 
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nts associated with increasing 
an knowledge of results; however, 
Brackbill and Kappy (1962) using a 
serial-learning task reported delay had 
no effect (the delay intervals in these 
studies were all comparable). The es- 
sential difference was attributed by 
Brackbill and Kappy (1962) to the 
extent to which response-produced cues 
operate in the situation. They reasoned 
that delay has no effect when cues are 
present, but if the situation is such that 
the subject is responding without cues in 
a random trial-and-error fashion, then 
delay of reward (or knowledge of re- 
sults) will decrease performance; when 
the subject is able to link the conse- 
quences with the response, delay has no 
effect. Brackbill and Kappy’s (1962) 
hypothesis may be evaluated by con- 
sidering two other lines of evidence. 
First, in a series of experiments with 
children in a simple response or discrimi- 
nation problem it was found that, similar 
to rat studies, children would prefer im- 
mediate reward and respond faster to 
immediate reward (Lipsitt & Castaneda, 
1958; Lipsitt, Castaneda, & Kemble, 
1959) and that delay of reinforcement 
would impede acquisition of a discrimi- 
nation response (Hockman & Lipsitt, 
1961). However, when subjects were 
instructed to pay attention to the goal 
(goal-directed instruction), delay of re- 
inforcement on a discrimination task did 
not produce an effect (Erickson & Lip- 
sitt, 1960). The implication, 
Brackbill and Kappy’s (1962) hypoth- 
esis, is that the delay interval has no 
effect when the subject can mediate the 
temporal interval by some cue, for 
example, a verbalization (Brackbil] & 
Kappy, 1962), a motor response (eg., 
Bilodeau & Bilodeau, 1958), or a goal 
orientation (Erickson & Lipsitt, 1960), 
In terms of the rat literature of the pre- 
vious section, cues or secondary reinforc- 
ers control the slope of the empirical 
temporal gradient of reinforcement. 
Second, although the argument of 


following 
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Brackbill and Kappy (1962) appears to 
make sense for the typically short (0-30 
seconds) delay intervals that have been 
used, the long 7-day delay interval re- 
ported by Bilodeau and Bilodeau (1958) 
would seem to be too long an interval to 
be bridged without decay by the mere 
presence of a cue. However, the re- 
sponse used (drawing a 3-inch line; in- 
formation on accuracy was delayed 7 
days) was based on a unit, the inch, 
familiar to the experience of all subjects; 
when unfamiliar units are used (Denny 
et al, 1960) the task is made more 
difficult and accuracy over time is re- 
duced. The extended interval appears 
to be an artifact of practice with the 
response. Again the crucial delay in 
accounting for performance decrement 
was the delay between successive re- 
sponses and not delay in knowledge of 
results. When the crucial intertrial in- 
terval is taken into account, delay has 
no effect on performance, at least on the 
simple motor responses studied. Brack- 
bill and Kappy (1962) argue that the 
effect of delay depends on the absence of 
cues. This is, in some ways, a theoretical 
return to the Spence (1947) theory and 
is perhaps suitable for the limited human 
data available. ; 
However, Brackbill and Kappy’s 
(1962) study deserves attention in an- 
other respect, for they also measure 
retention, which may be considered in 
some ways analogous to extinction meas- 
ures, Subjects were tested for recogni- 
tion and for savings after first learning 
a serial list under various delays of 1°- 
inforcement. The savings method pre- 
sumably measured the “persistence” © 
the responses learned under varying de” 
lays. Delay did not interfere with ac 
quisition but did facilitate retentio” 
measures the next day (but not 8 day 
later). Thus a constant delay during 
acquisition was associated with greate" 
resistance to forgetting. Although "° 
traditional extinction measure was take”: 
Brackbill and Kappy (1962) compare 
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their results to the animal work on ex- 
tinction. They concluded: (a) Rats do 
not use cues and thus delay retards 
acquisition. (b) With rats, variable 
delay, but not constant delay, increases 
resistance to extinction; however, par- 
tial or variable delay is a form of partial 
reinforcement and belongs with that 
literature. By implication, their argu- 
ment is that cues eliminate the perform- 
ance decrement produced by delay and 
the utilization of cues during acquisition 
(as in their experiment) is what increases 
retention or resistance to extinction. 
Their analysis requires comment. In 
many respects it is similar to the con- 
clusion presented earlier based on the 
rat data; that is, delay (detention) facil- 
itates cue utilization and cues may in- 
fluence extinction since resistance to 
extinction appears to be related to deten- 
tion time and opportunity to utilize the 
cues, The problem is probably not as 
simple as Brackbill and Kappy (1962) 
imply, for as has been indicated, cues 
may have several effects besides pro- 
ducing resistance to extinction or slower 
forgetting and may even elicit competing 
responses (Rieber, 1961). Further, con- 
trary to their analysis, response-produced 
cues are available to rats which affects 
the slope of the temporal gradient as was 
demonstrated by Grice (1948). The ef- 
fect of such response-produced cues (or 
external cues) on the extinction of a re- 
sponse learned under delay has yet to be 
systematically demonstrated in humans 
or rats, Finally, it is too early to con- 
clude as Brackbill and Kappy (1962) do 
that with animal subjects delay has no 
effect on resistance to extinction (@-8+ 


Fehrer, 1956; Pubols, 1958). 
Delay of Reinforcement and Abnormal 
Behavior 


Mowrer and Ullma 
first to give systematic att 
relevance of delay of rein 
abnormal and neurotic beh 


n (1945) were the 
ention to the 
forcement to 
avior. They 
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noted that neurotics often persist in mak- 
ing responses which lead to punishment. 
The resistance of symptoms to extinction 
as a result of nonreward or punishment 
Jed these authors to label the situation 
the “neurotic paradox” which aptly char- 
acterizes symptoms from the stand- 
point of reinforcement theory. Mowrer 
and Ullman (1945) went on to demon- 
strate how this paradoxical element could 
be explained by the temporal gradient of 
reinforcement by assuming that a symp- 
tom leads to some immediate reinforce- 
ment (e.g., anxiety reduction or second- 
ary gain) and punishment after a delay 
(e.g., isolation, criticism, etc.). If the 
punishment were delayed, even for a 
short interval, the sharp decline of the 
temporal gradient could lead to a situa- 
tion where the net difference could be 
reinforcing to the neurotic response, even 
though in absolute terms the punishment 
might have been severe enough to inhibit 
the response if it had been administered 
immediately. Mowrer and Ullman 
(1945) presented a demonstration rat 
study to support their theory. Thus, the 
arodoxical element was eliminated and 


p 
made consistent with reinforcement 


theory. The only apparent experimental 
test with human subjects of the Mowrer- 
Ullman (1945) theory led to inconclu- 
sive results (Freihage, 1957). 

Dollard and Miller (1950) utilized 
the gradient to explain why punished re- 
at were preceded by some 
reward would persist, but felt the re- 


verse was still paradoxical to reinforce- 
ment theory. They wrote “where the 
s delayed the princi- 


increase in misery i t 
ple of reinforcement can easily handle 
the problem; where an apparent increase 


in misery is immediate a clear dilemma 
is naei Ei 187].” Renner (1961) has 
tentatively indicated how the Mowret- 
Ullman (1945) model could be extended 
to cover a wider range of abnormal be- 
havior including the case of immediate 
punishment and delayed reward. 


sponses th 
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Preference for Delay of Reinforcement 
as a Personality Variable 


t series of experiments has 
Pay ability of children to toler- 
ate a delay of reinforcement. The pro- 
cedure provides the child with a choice 
between an immediate-smaller reward 
versus a delayed-larger reward, the im- 
plication being that the ability to post- 
pone gratification is a sign of maturity 
and reality contact (Mischel & Metzner, 
1962); whereas preference for immedi- 
ate-smaller gratification may indicate a 
more self-indulgent, impulsive, and im- 
mature personality. 

Mischel in a series of experiments 
(Mischel, 1958, 196la, 1961b, 1961c; 
Mischel & Metzner, 1962) has found 
preference for larger-delayed reinforce- 
ment to be associated with such variables 
as father in the home (1958), age (1958, 
1962), social responsibility (1961c, 
1961a), accuracy in time concepts and 
time perspective (1961c, 1962), need 
achievement (1961a), and intelligence 
(1962), Consistent results on time per- 
spective and intelligence, but not age, 
Were also found by Melikian (1959). 

The central concept is that the ability 
to delay gratification is acquired with 
age as a function of experience and that 
this capacity has specific personality 
correlates that are essentially equivalent 
to the level of social maturity. Two as- 
pects of Mischel’s work are relevant to 
this paper: (a) that the ability to accept 
delay of reward may be an acquired 
ability that changes with age and with 
experience with reinforcing agents (also, 

Mahrer, 1956), and (b) that delay of 
gratification may be related to abnormal 
behavior through acquired Personality 
mechanisms that render delayed conse- 
quences ineffective for the modification 
of behavior. From this point of view it 
is important to know not only how delay 
may be related to a learning paradigm 
of neurotic behavior (e.g., Mowrer & 
Ullman, 1945), but how particular learn- 


ing experiences or expectancies in the 
Rotter (1954) sense can develop which 
influence the nature of the temporal 
gradient of reinforcement. The last com- 
ment indicates still another reason why 
data on a learning-how-to-learn-under- 
delay process (discussed earlier) would 
be useful, although none are currently 
available. 


Concluding Remarks 


The topics covered have been too 
numerous to attempt a complete sum- 
mary, but several features are worth re- 
stating. It appears that delay during 
acquisition limits the strength of a re- 
sponse as long as some learning measure 
is used; however, when resistance to ex- 
tinction is used as the measure then the 
effect appears to be similar to the partial 
reinforcement effect. The relationship 
between constant delay, partial (or 
variable) delay, and partial reinforce- 
ment is not clear; but the role of cues, 
both external and response-produced, ap- 
pears to be important and complex. It 
was suggested that delay facilitates the 
utilization of cues but that the cues may 
have varied effects, depending upon the 
experimental conditions, The data on 
human delay of reward learning tend to 
agree in essence with the data from 
animal studies, Despite the long history 
of delay of reinforcement the vast ma- 
jority of work has been with reward and 
there is little data on punishment; the 
data that exist suggests similar temporal 
gradients of punishment and reward. 
More experimental work is needed 10 
order to clarify the effects of delay of re 
inforcement. i 

An interesting, and potentially signif- 
icant, application of the delay-of-rein- 
forcement literature to abnormal per 
havior and Personality characteristics 
has been illustrated by several studies» 


but a realization of this application 1°- 
mains to be done, 
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THEORY: 


A CASE STUDY IN THE HISTORY OF ETHOLOGY ! 
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Behaviors occurring out of their characteristic motivational context and 
in the midst of conflict and thwarting situations have been called “dis- 
placement activities,” Such behaviors were originally explained by refer- 
ence to energy models of motivation, It is now known that the nature 
and intensity of such activities are Primarily a function of 3 sets of 


stereotyped motor sequences elicited by 
certain externa] stimuli. 
building 
(Hinde, 1958) invo} 


Thus nest 


. uences: gath- 
Fee pains: and building All of 


these behaviors are obviously elicited by 
specific stimuli: gathering, by the pres- 


1 This paper is based, in Part, upon discus- 
sions with various members of the Depart- 
ment of Zoology, Subdepartment of Animal 
Behavior, Cambridge University, during the 
author’s tenure as a Postdoctoral Research 
Fellow, United State Public Health Service, 
1958-60. Preparation of the manuscript was 
supported by Grant M-5214, National Insti- 
tute of Mental Health, United States Public 
Health Service. For their detailed comments 
and criticisms of the manuscript, the author is 
indebted to R. A. Hinde and Hugh Rowell. 
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ence of grass and/or feathers; carrying, 
by stimuli in the bill; building, by fea- 
tures of both the material and the nest 
site. Such sensorimotor sequences are 
characteristic of al] species-specific be- 
haviors and the problems they raise are 
primarily those of the morphology and 
developmental history of such stimulus- 
response relationships. 

It was to the latter problem that the 
early American behaviorists addressed 
themselves primarily, and current re- 
search on the effects of early experience 
upon adult behavior (Beach & Jaynes, 
1954) isa direct descendant of the types 
of investigations carried out by Kuo 
(1932), Cruze (1935), and Carmichael 
(1936). These studies have had as their 
point of reference the perennial problem 
of the relative contributions of innate 
and acquired factors to the behavioral 
repertoire of organisms. Accordingly, 
they have been aimed at elucidating the 
developmental history of certain of the 
behavior Patterns which are to be found 
in the repertoire of adult animals. In 
so doing, they have contributed notably 
to the solution of the first of the three 
groups of problems raised by species- 
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eter erie 
wet Baie nes es vo! yes in the artifi- 
the aeii fe ween the innate and 

ebb, 1953; Schneirla, 
1956). 

However, even when the develop- 
mental history of a given sensorimotor 
sequence has been sufficiently clarified, 
certain diverse characteristics of species- 
specific behavior confront us with a 
second group of problems. Consider 
once again the example of nest building. 
Even though the sensorimotor mecha- 
nisms involved in this behavior are pre- 
sumably available throughout most of 
the bird’s adult life, nest building takes 
place only at certain periods in the bird’s 
life cycle. During these periods the bird 
is differentially responsive to stimuli 
which may have been consistently pres- 
ent in its environment, for example, 
grass and feathers, but which up to this 
time have been ignored. Such periodic 
or seasonal variations in the responsive- 
ness of sensorimotor mechanisms are 
characteristic of many species-specific 
behaviors. Witness seasonal migrations 
in birds and fish, or the variations in 
sexual receptivity of female rats during 
the estrus cycle. Moreover, in addition 
to such relatively long-term variations, 
short-term variations in responsiveness 
are also common; that is, a constant 
stimulus presented at intervals to an 
animal does not elicit a constant re- 
sponse. The most obvious examples are 
to be found in connection with feeding 
and mating behavior. The animal that 
has eaten is no longer responsive to 
food, and sexual responsiveness 10 the 
male is considerably reduced following 
copulation. Similarly, feathers and grass 
may be available continuously, but nest 
building occurs in discrete bouts which 
alternate with other activities. The oc- 
currence of nest building appears to a 
duce temporarily the responsiveness (0) 
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the sensorimotor mechanisms involved 
in the behavior. 

The third set of problems raised by 
species-specific behavior is best illus- 
trated by the random exploratory be- 
havior of a hungry animal, or the activity 
of an animal engaged in nest-site selec- 
tion. Such animals appear to be reacting 
to the deprivation of some stimuli, and 
for this reason Lashley characterized 
such behaviors as reactions to a deficit. 
Thus when an animal builds a nest it 
is clearly reacting to specific stimuli, 
but nest-site selection and the fact that 
the nest is built to a definite form sug- 
gests, in addition, reactions to a deficit. 
Similarly, at a certain stage of nest 
building, the canary stops gathering 
grass and begins to collect feathers 
which are used for the soft, inner lining 
of the nest. There are no obvious ex- 
ternal stimuli which are adequate to ac- 
count for the onset of nest building, the 
changeover from grass to feathers, or the 
termination of the nest with the achieve- 
ment of a definite species-specific form. 

The problems raised by variations in 
responsiveness and reactions to a deficit, 
because they relate to the arousal, direc- 
and termination of behavior pat- 
an to their developmental 
history, are often grouped under the 
rubric of motivational problems. Etho- 
logical theories of motivation represented 
an attempt to deal systematically with 


such problems. 


tion, 
terns rather th 


Models, and 


Drive Theory, Energy 
Motivation 


In endeavoring 
vational aspects 0 

cies-specific—man 
Pond it necessary tO postulate _the 
existence of either single- or multiple- 
drive processes- In either case, the drive 
is viewed as activating specific behavior 
patterns: different degrees of activation 
resulting in increases or decreases in the 


to deal with the moti- 
f pehavior—learned or 
y theorists have 
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probability of eae of their cor- 
related behavior wes re 
Implicit in many of these : Pane 
he concept of . motivational energy, 
t hich is often treated as analogous to 
pete comparable with physical, chemical, 
or electrical energy. While most con- 
temporary theories of motivation have 
been variants of drive theory, for certain 
theorists, for example, Hull, Hebb, 
Spence, etc., the energy analogy is 
merely implicit and is represented by 
a general-drive variable whose function 
it is to energize behavior. For other 
theorists, for example, Freud, McDou- 
gall, Lorenz, and Tinbergen the central- 
ity of the energy concept is indicated by 
the use of such terms as libido, psycho- 
physical energy, reaction-specific energy, 
motivational impulses, etc, These latter 
theorists, moreover, have made the en- 
ergy analogy still more explicit by em- 
bodying it in electrical, mechanical, or 
hydraulic models, In these energy 
models of motivation (Hinde, 1960), 
variations in the quantity, quality, or 
distribution of such energy are used to 
account for those observed variations in 
the behavior of anima 
grouped under th 
changes, 


Lorenz’s theory, 


ls customarily 
e rubric of motivational 


for example, was an 
attempt to account for the short-term 


variations in responsiveness typical of 
species-specific behavior, His observa- 
tions that sensorimotor sequences þe- 
come progressively more difficult to elicit 
with repeated presentations of the sign 
stimulus led him to suggest that these 
variations in responsiveness reflected 
variations in the level of accumulated ac- 
tion-specific energy in some hypothetical 
reservoir within the animal (Lorenz, 
1950). 

While such concepts might account 
for the motivational aspects of rather 
limited behavior patterns (consumma- 
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tory behaviors), they were patently in- 
adequate to deal with either long-term 
or with periodic variations in responsive- 
ness or with the problems raised by appe- 
titive behaviors—those behaviors which 
Lashley had characterized as reactions to 
a deficit. It was to deal with such 
behaviors that Tinbergen (1951) devel- 
oped his hierarchichal model of motiva- 
tion. This model retained Lorenz’s 
concept of action-specific energy but ex- 
tended the concept considerably. Instead 
of postulating motivational energy whose 
action specificity was restricted to a par- 
ticular, limited response pattern, he 
viewed motivational energy as being gen- 
eral to all the activities of a major 
instinct. He thus replaced action-specific 
energy with what might be termed drive- 
specific energy. Thus the same type 
of motivational energy could activate all 
the diverse sensorimotor mechanisms 
grouped by Tinbergen under the rubric 
of the reproductive instinct—mating, 
nest building, aggression, maternal be- 
havior, etc. Each of these subinstincts, 
however, was also activated by motiva- 
tional energy specific to it, which was 
accumulated in its own specific reservoir 
or center, 


The difficulties inherent in the use of 
Such energy models have been percep- 
tively discussed by Lehrman (1953), 
Kennedy (1954), and Hinde (1959, 
1960). Although none of these authors 
Considers displacement activity in detail, 
it is clear from their comments that 
the concept of displacement activity has 
been among the problematic aspects of 
ethological theories of motivation. It 
is equally clear from Hinde’s (1960) 
recent paper that the vicissitudes which 
this concept has undergone indicated the 
necessity for a revision of such theories- 
It is for this reason that the present 


paper is subtitled “A Case Study in the 
History of Ethology.” 
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Displacement Activity 


_ One of the most obvious characteris- 
tics of animal behavior is its directed- 
ness, Any individual pattern of behavior 
normally occurs as part of a sequence of 
functionally related behavior patterns. 
It appears to serve the same ends as do 
the behavior patterns which precede and 
follow it, and to share with them a com- 
mon history of environmental stimula- 
tion. The grooming of birds, for ex- 
ample, normally occurs after bathing, 
when it serves to clean and resettle the 
feathers, It is thus associated with a 
recent history of certain types of en- 
vironmental stimulation, for example, 
water or dirt on the feathers, and we 
should expect it to occur only in its 
proper place in the functional sequence 
and only in the presence of such stimu- 
lation, Should grooming occur in the 
absence of such stimulation and in the 
midst of a sequence of behaviors (e.g.; 
fighting, mating, etc.) which does not 
include bathing, we should be justified in 
considering such grooming behavior 
anomalous, since it appears out of its 
usual behavioral context. 

Tt is thus a matter of some surprise 
and considerable interest to find that 
out-of-context activities are observed 
quite frequently in the behavior of ani- 
mals, Thus, for example, postures and 
movements normally associated with 
feeding, grooming, nest building, or 
sleeping in birds are often observed to 
occur out of context during aggressive 
or sexual encounters with other birds 
(Andrew, 1956a; Baggerman, Baerends, 
Helkins, & Mook, 1956; Hinde, 1952; 
Lorenz, 1941; Makkink, 1931, 1936; 
Moynihan, 1953; Tinbergen, 1940). 
Similar observations have been recorded 
for fish (Tinbergen & van Tersel, 1947) 
and invertebrates (Bastock & Manning, 
1955: Crane, 1957; Gordon, 1955). At 
the mammalian level, out-of-context be- 
havior has been observed in rats (Bolles, 
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1960), cats (Armstrong, 1950), dogs 
(Schmidt, 1956), and wolves (Schenkel, 
1947). Finally, both Lorenz (1950) and 
Tinbergen (1951) have suggested that 
such characteristic human behaviors as 
head scratching and tie straightening in 
emotional situations may be viewed as 
out-of-context activities (for an earlier 
review, see Armstrong, 1950). 

The possible theoretical significance 
of out-of-context activities was first 
noted, independently, by Kortlandt 
(1940) and Tinbergen (1940) who 
pointed out certain peculiarities in the 
behavior patterns themselves or in the 
conditions under which they occurred. 
First, such behaviors appeared to be 
most frequent in situations of either con- 
flict or thwarting. A conflict situation 
may be characterized by the simulta- 
neous activation of two or more incom- 
patible behavior patterns. Such situa- 
tions occur frequently during sexual 
(attack, escape, copulate) or aggres- 
sive encounters (attack, escape). The 
various behavior patterns elicited by the 
situation are incompatible and the ani- 
mal generally alternates among them. 
Thwarting situations arise when a 
specific response pattern is activated but 
the animal is prevented from perform- 
continuing the 


ing, completing, or 
response. A male stickleback, for 
example, cannot perform coition until 


the female follows him to the nest, and 
in the presence of incompletely aroused 
females will often display out-of-context 
nest-ventilation movements. 

The out-of-context behaviors observed 
in conflict and thwarting situations were 
described as being different in kind and 
degree from the same behaviors in their 
normal context. To describe these dif- 
ferences, Tinbergen (1940) used such 
terms as irrelevant, frantic, and incom- 
plete. The term irrelevant implied that 
the stimuli which normally elicit the 
behavior in an appropriate context were 
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hus out-of-context feeding re- 
So take place in the absence 
of food. The terms incomplete and 
frantic referred to the fact that out-of- 
context activities are often broken off in 
the middle, seem hurried, and are ec- 
centric, or imperfectly oriented. For 
example, bill wiping, a form of grooming 
in the zebra finch, is normally performed 
with the bird’s body and head parallel 
to the branch or twig on which it is 
perched. It thus makes contact with 
the branch when the head is lowered and 
the bill wiped. By contrast, out-of-con- 
text bill wiping often takes place with 
the bird’s body at right angles to the 
branch so that the bill wipes in midair 
(Morris, 1954), 

Finally, it was noted that out-of-con- 
text activities occur most frequently 
when the two antagonistic response tend- 
encies are in equilibrium, that is, when 
they are of (presumably) equal strength, 
For example, out-of-context n 
in the stickleback was foun 


van Iersel, 1947), 
scribed above all 


share a common characteristic, Tp every 


case, a specific response tendency has 
been activated but its overt manifesta- 
tion has somehow been blocked; either 
by the presence of an antagonistic re- 
sponse tendency, or by the absence of 
an appropriate stimulus situation, 

On the basis of such observations, 
Tinbergen (1940) and Kortlandt (1940) 
postulated a motivational mechanism de- 
rived from and compatible with the type 
of drive theory current in ethology at 


the time. They suggested that, in the 
conflict situation, the motor patterns 
belonging to one of the drives (e.g, 
escape) are incompatible with the motor 
patterns belonging to the other (e.8-, 
attack) drive. In the thwarting situa- 
tion, they suggested, a given drive can- 
not express itself fully because the re- 
quired releasing stimulus is not present. 
In either case, the result—in terms of 
an energy model—is a build-up of moti- 
vational energy which cannot discharge 
itself through the appropriate motor 
channels because these channels are 
blocked or otherwise unavailable. Tin- 
bergen (1951) therefore suggested that 
such energy finds an outlet by discharg- 
ing through the center of another in- 
stinct (p. 117). Since the center through 
which it discharges may have no func- 
tional relation to the center which is 
blocked, the resultant behavior will 
necessarily be out of context and occur 
in the midst of behavior sequences to 
which it is irrelevant. Out-of-context 
behaviors were thus viewed as resulting 
from a sparking over (Ubersprung) 
of motivational energies from the motor 
system of one instinct to that of another, 
and were termed by Tinbergen Uber- 
Sprungbewegungen (Tinbergen, 1940). 
The English equivalent, displacement 
activities, was suggested by Armstrong 
(1947). Grooming, which is observed 
to occur in the midst of an aggressive 
encounter would be called displacement 
grooming, since it has not been elicited 
by the normal stimuli for grooming nor 
activated by a grooming drive, Rather it 
is viewed as being motivated by energy 
from fleeing and/or fighting centers 
which has sparked over into the groom- 
ing center. As Kortlandt (1940) put it, 
the displacement activity is an expres 
sion not of its own drive (autochthonous 
motivation) but of a strange or different 
drive (allochthonous motivation). 

he phenomenon of displacement ac- 
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tivity had certain immediate implica- 
tions for ethological drive theory. In 
Lorenz’s original formulation of the 
theory, each fixed action pattern (sen- 
sorimotor sequence) in the animal’s rep- 
ertoire was presumably activated from 
a reservoir of motivational energy spe- 
cific to that pattern. The phenomenon of 
displacement activity suggested that sur- 
plus energy, overflowing from a given 
reservoir, could activate behavior pat- 
terns not normally fed from that reser- 
voir, Similarly, Tinbergen had originally 
viewed motivational impulses as being 
drive-specific; since they could activate 
only one hierarchical system of centers 
(e.g., that of the reproductive instinct). 
The discovery that motivational energy 
could spark over from one such hierarchi- 
cal system to another, for example, from 
reproduction to body care, implied the 
existence of general as well as specific 
motivational energy. The theory which 
finally emerged (Tinbergen, 1951) com- 
bined both possibilities and provided 
for both general drive and multiple- or 
specific-drive processes. In this respect 
it was similar both to Freudian theory 
(Colby, 1955) and to much contem- 
porary psychological theorizing. 
Moreover, both Tinbergen (1951) and 
Lorenz (1957) suggested that the pri- 
mary function of displacement activity 
is a cathartic one. It was viewed as per- 
mitting the resolution of conflicts be- 
tween two antagonistic drives by acting 
as an outlet through which surplus moti- 
vational energy could blow off. Several 
writers (Armstrong, 1950; Bastock, 
Morris, & Moynihan, 1953) pointed to 
the obvious similarity between displace- 
ment activity in animals and a variety of 
neurotic and emotional behaviors in man, 
and in animals suffering from experi- 
mental neurosis—regression, object dis- 
Placement, neurotic passivity, etc. Such 
similarities were of particular interest 
for psychoanalytic instinct theories 
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(Barnett, 1955; Rapaport, 1960). Fur- 
thermore, both Lorenz (1957) and Tin- 
bergen (1951) pointed to the presence of 
displacement activities in man as “sug- 
gesting an instinctive organization in 
man basically similar to that found in 
animals [Tinbergen, 1951, p. 210].” 

Finally, there occurred one of these 
processes of circular reasoning only too 
common in the early stages of theory 
development. On the one hand, the neat- 
ness and ease with which the ethological 
energy model accounted for the phe- 
nomenon of displacement activity was 
taken as an indication not only of the 
utility of the model, but of its validity. 
Conversely, the very plausibility of the 
model lent credence to a surplus-energy 
interpretation of displacement activity 
and militated against an explanation in 
more analytic terms. For this reason 
the surplus-energy interpretation pre- 
sented by Tinbergen and Kortland in 
1940 was to go essentially unchallenged 
for more than a decade. 


The Causal Analysis of Displacement 
Activity 

The publication, in the early fifties, 
of comprehensive statements of etho- 
logical theory (Lorenz, 1950; Tin- 
bergen, 1951) elicited a barrage of criti- 
cisms from both psychologists (Lehrman, 
1953) and ethologists (Hinde, 1959; 
Kennedy, 1954). These criticisms were 
instrumental in stimulating more de- 
tailed investigations of the factors under- 
lying displacement activity. The results 
of these investigations cast serious doubt 
upon the motivational uniqueness of dis- 
placement activity. 

It became apparent, for example, that 
under certain circumstances displace- 
ment activities need not be incomplete 
or imperfectly oriented but might be in- 


distinguishable in form or orientation 


from the same behaviors in em 
in the 


Thus displacement grooming 1n 
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Morris, 1954) might in- 
= een sequence of grooming 
ee ae stretching, shak- 
ae scratching—used by the bird K 
normal grooming, while displacemen 
feeding in this species could range from 
incomplete pecking at the floor to actual 
feeding in which the bird swallowed 
food. For additional examples see Arm- 
strong (1950) and Andrew (1956b). ; 

A second distinguishing characteristic 

of displacement activity had been its 
supposed irrelevance. Two assumptions 
appear to underlie the use of this term. 
First, the behavior was assumed to be 
independent of any other ongoing activ- 
ity of the animal, However, it has been 
shown that the occurrence of one dis- 
placement activity rather than another 
in a given situation appears to be related 
to the animal’s posture (Lorenz, 1957; 
Tinbergen, 1952). Secondly, it was as- 
sumed that the stimuli which normally 
elicit the activity were absent. This as- 
sumption too proved to have been un- 
justified, since a large body of evidence 
indicated that the nature of the displace- 
ment activity shown depended upon or 
was strongly influenced by the external 
stimulus situation, For example, dis- 
placement feeding in the turkey (Raber, 
1948) might take the form of either 
eating or drinking depending upon the 
relative availability of food and water, 
Displacement feeding in the great tit 
(Hinde, 1952) might take the form 
either of turning over leaves, or pecking 
at the ground, depending upon whether 
the bird were perched in a tree or on the 
ground. Similarly, in the courtship of 
the zebra finch (Morris, 1954) the bird 
might show displacement feeding, if food 
were available; if a female were avail. 
able (during a fight with another male) 
displacement mounting might take Place, 
that is, mounting without either prelimi- 
nary courtship or succeeding copulation, 
If neither of these alternatives were 
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available, the bird might engage in dis- 
placement grooming or displacement 
sleeping. 

Finally, Morris (1956) and Andrew 
(1956b) pointed out that conflict situa- 
tions in birds and mammals are accom- 
panied by a wide variety of autonomic 
responses including respiratory, circula- 
tory, and thermoregulatory changes. 
These, in turn, would provide a host of 
stimuli arising from the feathers, blood 
vessels, and skin which might be ex- 
pected to elicit a variety of grooming 
activities. 

With the gradual accumulation of such 
observations it was becoming apparent 
that displacement activity is not irrele- 
vant but is clearly causally related to the 
ongoing activities of the animal and/or 
the nature of the external stimulus situa- 
tion. Nevertheless, a careful analysis of 
the motor patterns utilized as displace- 
ment activities indicates that there is 
some sense in which these patterns are 
unique. R 

Of all the motor patterns available in 
the animal’s repertoire, only a limited 
number are ever observed to occur a5 
displacement activities. The reader may 
have already noted that displacement 
grooming, feeding, and sleeping have 
been referred to most frequently. It 1S 
difficult to characterize such patterns 
except to note that they are common oF 
readily available ones for the species in 
question (Lorenz, 1957). Grooming, for 
example, is one of the commonest behav- 
ior patterns of birds and mammals. A 
deed, Bolles (1960) has reported tha 
male rats in their home cages spend 40% 
of their waking hours in grooming behav- 
iors, Similarly, oral activity, both i” 
and out of feeding situations, is commo” 
in birds and mammals, As for relatively 
less-common activities (e.g., nest puild- 
ing, mounting, incubation), these at 
found to occur as displacement activities 
only when their threshold for occurrencë 
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has been temporarily lowered—either by 
internal (seasonal) changes or by the 
presence of external stimuli (eg., a 
mate, a nest, or eggs). In short, the 
motor patterns exhibited most frequently 
as displacement activities in conflict or 
thwarting situations are precisely those 
which, ceteris paribus, are likely to 
occur with the greatest frequency even 
under normal circumstances. 

This generalization has been shown to 
hold not only for complex behavior pat- 
terns such as feeding or grooming, but 
for the individual responses which con- 
stitute these complex behaviors. Groom- 
ing in birds, for example, may consist of 
the following set of responses: head 
shaking, preening, feather settling, bill 
wiping, and scratching. By studying the 
temporal organization of these responses, 
several investigators (Andrew, 1956a; 
Morris, 1954; van Iersel & Bol, 1958) 
have shown that there is a regular se- 
quence of such responses, with one type 
of response regularly following a given 
response and preceding another. Dur- 
ing normal grooming in the bunting, for 
example, bill wiping and feather set- 
tling are seen first, followed by preening 
and finally by scratching. Other things 
being equal, some types of grooming 
responses are more likely to occur during 
han are others. It is 


normal grooming t t 
precisely these low threshold grooming 
responses which are observed to occur 
most frequently as displacement activi- 
ties. 

The sole remaining characteristics of 
displacement activity which require ex- 
planation are the occurrence of the be- 
haviors out of their normal contexts and 
in the midst of situations involving con- 
flicting or thwarted response tendencies. 
Why, in addition, does displacement 
activity in conflict situations occur most 
frequently when the antagonistic re- 
Sponse tendencies appear to be of equal 
Strength? These characteristics consti- 


tuted the crux of the problem of dis- 
placement activity and as the evidence 
began to accumulate several alternatives 
to the original surplus energy hypothesis 
began to emerge. 

In a study of normal and displacement 
grooming in birds, Andrew (1956a) ob- 
served that not only was grooming one 
of the commonest activities in the birds 
repertoire, it was also one of the most 
easily delayed, suppressed, or inter- 
rupted activities. If a bathing bird were 
frightened, for example, it would flee 
in the midst of bathing without respond- 
ing to its wet plumage by a burst of 
grooming activity. Andrew further ob- 
served that when grooming did occur, it 
was most likely to take place at a time 
when the animal was changing over from 
one activity to another—an observation 
since confirmed and extended to the rat 
by Bolles (1960). Thus it might occur 
before and after bursts of locomotion, or 
after a sexual display but before nest- 
building activity. 

On the basis of his observations, 

Andrew (1956a) presented a hypothesis 
which accounted for the occurrence of 
grooming as a displacement activity dur- 
ing conflict situations, He suggested 
that the peripheral stimuli which elicit 
grooming are continually present but 
that grooming is easily suppressed by 
other activities. In conflict situations, 
when a response tendency such as escape, 
which normally takes precedence over 
grooming, is blocked by an equally 
strong competing response tendency such 
as attack, grooming may occur in re- 
sponse to the usual peripheral stimuli. 
There are then two types of causal fac- 
tors underlying displacement grooming 
—direct factors and indirect factors. Di- 
rect factors would include the presence 
of the peripheral stimuli normally ao 
ing the behavior, while we a Ta 
factor was the extent to WMC o 


370 


activities, incompatible with grooming, 
$ 
EA (1959) independently sug- 
gested that displacement activity might 
be explained by reference to three vari- 
ables: arousal level, habit strength, and 
sensory cues. He postulated that ani- 
mals in conflict or thwarting situations 
are characterized by increased levels of 
arousal, and that in this condition the 
activities which are most likely to occur 
are those “which are prepotent in the 
animal’s repertoire.” The specific activ- 
ity which occurs will be determined by 
the nature of the peripheral stimuli 
present. 

A third hypothesis (the disinhibition 
hypothesis) was suggested by van Iersel 
and Bol (1958) and Sevenster (1960, 
1961). It assumed that excitatory fac- 
tors are constantly present in some de- 
gree for all behavior patterns, but that 
inhibitory relationships exist among 
various behavior patterns such that the 
activation of one such pattern sup- 
presses the occurrence of others. In 
conflict situations, the antagonistic re- 
sponse tendencies tend to inhibit each 
other thus decreasing the inhibition each 
alone exerts on other activities, The 
degree of disinhibition, together with the 
extent to which internal factors an 
ternal factors (peripheral stimuli 
present for other activities determines 
which of these activities will be shown as 
a displacement activity. 

All three of these hy 
a considerable advan 
“surplus” hypothesis, They all discard 
the untenable distinction between au- 
tochthonous and allochthonous motiva- 
tion; they recognize the role of the 

stimulus situation and they describe the 
occurrence of displacement activity in 
terms of the probability of occurrence 
of other behavior patterns. However, 
the Bindra (1959) hypothesis involves 
the postulation of an intervening varia- 


d ex- 
) are 


potheses represent 
ce on the original 
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ble of arousal level; a concept whose 
formal properties have frequently been 
analogized to those of a general-drive 
concept (Hebb, 1955; Hinde, eet 
Similarly, both versions of the disinhi- 
bition hypothesis are phrased in terms 
of drives and centers. Moreover, inhibi- 
tion and disinhibition are simply more 
elaborate descriptive terms and their 
use in this context is tantamount to a 
restatement of the observation- that 
grooming occurs more frequently in one 
situation than in another. ie 
The problem of displacement activity 
has been considerably clarified by F 
recent experimental investigation © 
normal and displacement grooming A 
the chaffinch (Rowell, 1961). Rowe 
achieved a considerable degree of con- 
trol of the relevant variables, of yuia 
the most important are the existence i 
a conflict situation and the presence © 
the peripheral stimuli eliciting gom ii 
Such stimulation was produced ae 
by spraying the birds with a fine wat f 
spray, or by using a sticky birdseee 
which produced grooming responses m 
stricted to the bill (bill wiping). T 
kinds of approach-avoidance coni 
situations were used, both of whic 
could be varied in intensity. e 
By varying the amount and/or ar 
of peripheral stimulation, Rowell Ls 
able to affect both the composition (bo y 
grooming versus bill wiping) and a 
quency of grooming responses in bo g 
normal and conflict situations. He re 
cluded that the pattern and intensity BA 
peripheral stimulation are the dire 
causal factors underlying grooming in A 
situations, thus confirming the first P2 
of Andrew’s (1956a) hypothesis. a 
He further demonstrated that the m 
quency of grooming varies inversely WÍ 
the frequency of other activities a 
as locomotion and that grooming is oe 
sible only in the pauses between = 
activities. Furthermore, grooming 
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not occur in all such pauses, but only in 
those above a certain minimum length 
(which is always longer than the dura- 
tion of the actual motor patterns in- 
volved in grooming). Finally, the dif- 
ferent types of grooming responses take 
different lengths of time; the most com- 
mon being of short duration and thus 
least susceptible to interruption by other 
activities, Not only the occurrence but 
also the composition of grooming is in- 
fluenced by the length of time available. 
This confirms another portion of the 
Andrew (1956a) hypothesis which sug- 
gested that the presence or absence of 
behaviors incompatible with grooming 
constitutes an indirect causal factor un- 
derlying grooming. 

One further factor remains to be con- 
sidered. Displacement activity is most 
frequent when the conflicting response 
tendencies are equally balanced, that is, 
when the animal is in a state of motiva- 
tional equilibrium with respect to these 
tendencies. In this situation, grooming 
may be expected to occur in response to 
the usual peripheral stimuli. The exist- 
ence of such an equilibrium state is not 
directly observable but was inferred by 
Rowell from variations in the bird’s 
behavior, Rowell reasoned that if, in an 
approach-avoidance situation, the ani- 
mal pauses in its locomotion and then 
reverses its direction a state of equi- 
librium might be presumed to have been 
reached, since the approach tendency 
has changed into an avoidance tendency 
(or vice versa). If, as Andrew (1956a) 
had hypothesized, the presence of an 
equilibrium state is a major factor per- 
Mitting grooming, grooming should be 
Most frequent in those pauses involving 
changes of direction. This prediction 
Was confirmed in Rowell’s study and, in- 
dependently, in a study by Tugendhat 
(1960) of approach-avoidance conflicts 
in sticklebacks, It therefore appears that 
the equilibrium state is a permissive 
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factor, and that variations in grooming 
behavior in such a situation are due 
simply to variations in the intensity and 
type of peripheral stimulation. 

We are now in a position to account 
for all the observed characteristics of 
displacement grooming and to extend 
our explanation to other types of dis- 
placement activity. Behaviors observed 
to occur as displacement activities are 
those whose threshold for elicitation is 
low at the time, The most common dis- 
placement activities (feeding, grooming) 
are maintenance activities with very low 
thresholds. Other behaviors, for ex- 
ample, sexual and maternal behaviors, 
will occur as displacement activities when 
their thresholds have been reduced by 
the presence of hormones or of external 
stimuli such as nests, eggs, or other 
birds. The occurrence of displacement 
activity in conflict situations is due to 
the fact that such situations provide for 
the existence of equilibrium states which 
are characterized by the temporary ab- 
sence of competing responses and the 
presence of pauses whose duration is 
long enough for the displacement activity 
to occur. Since different responses re- 
quire different minimal lengths of time 
to occur, those which require long times 
are most likely to be interrupted by other 
activities. Thus the appearance of in- 
completeness or of frantic behavior is 
undoubtedly due to the fact that the 
duration of the equilibrium is generally 
too short to permit the complete oc- 
currence of the entire behavior pattern. 
Finally, in thwarting situations the 
stimuli which would elicit normal, higher 
threshold responses are absent and other 
behaviors prepotent to some extent at 
the time may occur in response to their 
normal stimuli. 

Such a causal analys 
activity is a far cry 
conceptualization as @ 
provided cathartic relie 


is of displacement 
from its original 
mechanism which 
f in neurotic con- 
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flicts by discharging surplus motiva- 
tional energy. 


Implications for Motivational T heory 


The original attractiveness of etho- 
logical energy models of motivation was 
due, in part, to the ease with which 
they appeared to account for certain 
diverse motivational phenomena. The 
experimental analysis of instinctive be- 
havior has now progressed to the point 
where alternative explanations of these 
phenomena are possible. Short-term 
variations in responsiveness, for example, 
may frequently be due not to the ex- 
haustion of reaction-specific energy but 
to stimulus-specific habituation proc- 
esses, Or various types of sensory feed- 
back (Bol, 1959; Hinde, 1959; Lehr- 
man, 1956a), Reactions to a deficit may 
frequently be accounted for in terms of 
interactions between species-specific 
nd species-typi- 
cal environments (Lehrman, 1956b; Van 
der Kloot & Williams, 1953a, 1953b) 


he past experi- 


Possible to ac- 
activity without 
ls of motivation, 
» Whether Freud’s, Mc- 
Dougall’s, Lorenz’, or Tinbergen’s, have 

been inextricably linked with drive con- 
structs, so that in discarding the energy 
aspects of the models we raise the pos- 
sibility of discarding the drive constructs 
as well. However, it may be advanta- 
geous at the outset to distinguish þe- 
tween specific- and seneral-drive con- 
structs and deal with them separately, 
The term specific drive often Teflects 

a confusion between functional and 
causal approaches to the analysis of be- 
havior. On the one hand, such concepts 
are used as functional labels to catego- 
rize and describe groups of behavior pat- 
terns which appear to share a sommon 
function. The diverse activities involved 


in nest building, since they eventuate in 
a completed nest, are lumped together 
under the rubric of a nest-building drive 
or under the broader rubric of a maternal 
drive. However, since the activities in- 
volved in nest building appear to fluctu- 
ate together in time it is often ey 
that they must share a common causal 
mechanism. Hence the concept of a 
specific drive is often invoked causally as 
an explanation for the spontaneous = 
currence, temporal organization, an 

goal directedness of the behavior. It 
would obviously be advantageous to 
discard such constructs and replace them 
with explanations of the type exempli- 
fied by the causal analysis of displace- 
ment activity. Fortunately, such analy- 
Ses are rapidly becoming available for 
almost every aspect of species-specific 
behavior including sexual behavior in 
mammals (Beach, 1956), incubation be- 
havior in birds (Baerends, 1959), ma- 
ternal behavior in birds and mammals 
(Lehrman, 1961), nest building in birds 
(Hinde, unpublished *) and cocoon spin- 
ning in silkworms (Van der Kloot & Wil- 
liams, 1953a, 1953b), hunger in mam- 
mals (Miller, 1957; Teitelbaum, 1962) 
and in the blowfly (Dethier & sae 
stein, 1958), exploratory behavior an! 

play (Berlyne, 1960; Welker, 1961). 
The existence of such causal analyses 
has rendered the construct of specific 
drive superfluous, 

Perhaps the most ubiquitous of pe 
Constructs invoked by iotivation 
theorists is that of a general drive whic 
functions solely to energize behavior 
Although such a concept does not forn $ 
an explicit part of either Lorenz’ or T 
bergen’s theory, it was implicit in t 3 
original formulation of the surplus 
hypothesis of displacement activity. Fo 
Psychological theorists the concept ‘1 
a general drive has Occupied a centt 


2 Conference on 


in 
Sex and Behavior held 
1961-62 in Berkele 


Y, California. 
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place in learning theory since Hull, even 
though the behavioral evidence custom- 
arily cited in its support has been fre- 
quently considered of dubious validity 
or doubtful relevance (Bolles, 1958; 
Estes, 1958). The recent emergence of 
the concept of arousal within motiva- 
tional theory represents an extension of 
the general-drive concept to problems 
other than those involved in learning— 
most notably to those of emotional be- 
havior. 

The term arousal was originally used 
in conjunction with physiological studies 
of a so-called arousal or activating sys- 
tem located in the brain stem. With the 


publication of papers by Duffy (1957), 
Lindsley (1951), Hebb (1955), and 
Malmo (1957), the implications of these 
physiological findings were explored by 
psychologists and the concept of arousal 
gained widespread currency (Berlyne, 
1960; Bindra, 1959), As currently em- 
ployed the term is indistinguishable in 
its functional properties from the con- 
cept of an energizing general drive. 
Some theorists have even identified gen- 
eral drive with the activity of the so- 
called nonspecific activating system of 
the brain stem (Hebb, 1958; Lindsley, 
1957). 

Such terms as arousal level and non- 
specific activating system share an 
important feature with other drive con- 
structs in that they have all the ear- 
marks of what may be termed a blanket 
variable. They are invoked to account 
for a variety of diverse effects whose 
only common characteristic may b 
fact that they all fluctuate togeth 
some extent, Moreover, a concept of 
motivational energy is implicit in the 
arousal construct so that it is marred by 
the same conceptual difficulties inherent 
in general drive constructs. These con- 
ceptual difficulties, and certain recent 
developments in psychology and physiol- 


e the 
er to 
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ogy suggest the need for a re-examina- 
tion of the arousal concept. 

The validity of the physiological evi- 
dence upon which the concept rests is 
currently being called into question. A 
recent group of studies (Adametz, 1959; 
Chow & Randall, 1960; Doty, Beck, & 
Kooi, 1959; Kreindler, Unger, & Volan- 
skii, 1959) indicates that the integrity 
of the midbrain reticular formation is 
not an indispensable precondition for 
the existence of either physiological or 
behavioral arousal or for the occurrence 
of emotional behavior or learning. 
Studies by Sprague, Chambers, and 
Stellar (1961) suggest that affective 
responses to sensory stimuli may be 
mediated by structures other than the 
nonspecific reticular system. Detailed 
analyses of brain-stem organization in- 
dicate that the so-called activation effects 
of reticular stimulation are not the re- 
sult of a unitary arousal system but are 
mediated by the effects of such stimula- 
tion upon a variety of diverse physiologi- 
cal mechanisms (Dell, 1958; Fuster, 
1961). On the basis of a comprehensive 
review of the anatomy and physiology of 
the brain-stem reticular formation, Rossi 
and Zanchetti (1957) concluded that 
this structure was unlikely to act indis- 
criminately as a sheer energizer. 

Bindra (1959), noting that the vari- 
ous indices of arousal do not correlate 
very satisfactorily, has suggested that it 
may be premature to assume that these 
indices are representative of some uni- 
tary arousal process. Despite this caveat, 
explanations of emotional behavior in 
arousal terms continued to multiply. 
Indeed, it was because conflict and 
thwarting situations are customarily 
viewed as emotional situations that the 
concept of arousal played a central role 
in Bindra’s hypothesis of displacement 
activity. As we have seen, such a con- 
ved superfluous for the ex- 


cept has prov T 
planation of displacement activity and 
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recent studies by Schachter and his 
associates suggest that an arousal con- 
struct may be equally superfluous for 
the understanding of other types of emo- 
tional behaviors as well (Schachter & 
inger, 1962). 
ee a has been suggested that 
given the fact that the central nervous 
system is continuously active it may not 
be theoretically necessary to postulate 
an energizing factor (Hinde, 1960). 
Several theorists have found it possible 
to discard general-drive constructs en- 
tirely and present alternative approaches 
to the motivational problems raised by 
learned and species-specific behaviors. 
The essential feature of all these ap- 
proaches is the consideration that the 
occurrence of a specific response in a 
given situation is the outcome of its 
Successful competition with other re- 
sponses which themselves have a certain 
probability of occurrence in that situa- 
tion (Bindra, 1961; Estes, 1958; Hinde, 
1959), The similarity of this formula- 
tion to those proposed by Andrew and 
Rowell is obvious. 

One final implication of the causal 
analysis of displacement activity is worth 
considering. It will be recalled that the 
ethological investigators concentrated 
their attention not only upon the specific 
response under investigation (e.g., dis- 
placement grooming responses) but also 
upon the general behavioral context 
within which these responses occur, Now 
typically, as Bindra (1961) has pointed 
out, psychologists restrict their atten- 
tion to the defined responses under in- 

vestigation (e.g., running, bar pressing, 
key pecking) and ignore the matrix of 
general activity components (e.g., sniff- 
ing, grooming, walking, sitting) within 
which the defined response takes place. 
Bindra has convincingly demonstrated 
that a detailed consideration of such 
background behavior may shed _con- 
siderable light upon the conditions 


H. PHILIP ZEIGLER 


underlying the emergence of the specific 
responses themselves. Accordingly, it is 
to be hoped that in the future such a 
microanalysis of behavior will become a 
more widely-accepted technique in the 


study of both species-specific behavior ` 


and learning. 
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SOCIAL-LEARNING THEORY 
DESIRABILITY, AND THE MMPI 
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Recent factor analytic interpretations of the MMP. i: 

response sets as sources of variance, and the ae i ea 
socially desirable or undesirable ways has been interpreted as the Das 
factor. It is contended that the primary factor is better interpreted k 
degree of psychopathology and both social-learning theory and empirical 
data are presented to support this view. In brief, socially desirable be- 
haviors are typically those which conform to group standards, and the 
deviant behaviors which define psychopathology are usually socially un- 
desirable. Accordingly, the tendency for MMPI scales to load inversely 


to social desirability on the primary fac 
inverse relationship between deviancy an 


N Early in the development of the 
MMPI the test developers recognized 
pe the item responses were susceptible 
to deception and considerable effort went 
into establishing special scales to detect 
(Z and F) and correct for (K )dissimu- 
lation (Meehl & Hathaway, 1946). 
Since that time there have been a great 
many published studies investigating the 
question of how much performance vari- 
ance on the MMPI can be attributed to 
level and type of psychopathology and 
how much to dissimulation. It is surpris- 
Ing that with as much accumulated evi- 
dence as there is, the issue is still a con- 
troversial one, In the opinion of the 
Writer, some clarification may result 
from expanding the psychometric ques- 
tion to include the more basic issue of 
the relationship between socialization 
and psychopathology. The first purpose 
Of this paper is to present 4 review of 
those studies in which the effect of de- 
ception upon MMPI performance has 
been investigated. The second goal is to 
Present an interpretation of the old evi- 
dence from the standpoint of social- 
learning theory and to muster some rela- 
nie È author would like to extend his grati- 

o Leonard D. Goodstein for his helpful 


Criticism nted in this 
Since of the proposals prese: 


tor is a consequent of the actual 
d social acceptability of behavior. 


tively new evidence in behalf of the 
interpretation. 


REVIEW OF THE LITERATURE 


Effect on MMPI Performance of In- 
structed Simulation 

When the MMPI has been adminis- 
tered with instructions to simulate nor- 
mal or psychopathologic performance, 
inconsistent results have been obtained. 
Hunt (1948) found that college stu- 
dents were able to alter their profiles 
when told to fake good or fake bad. 
Wiggins (1959) and Wiggins and Rum- 
rill (1959) report moderate correlations 
between MMPI scales and a scale 
derived by social-desirability role-play- 
instructions to college subjects. 
Cofer, Chance, and Judson (1949) re- 
port that pattern differences were ob- 
tained, relative to standard instructions, 
when college subjects were asked to 
simulate normal or abnormal perform- 
ance. Greater shifts followed from the 
abnormal test-taking set. When psy- 
chiatric patients were given the MMPI 
under instructions to simulate normalcy, 
Grayson and Olinger (1957) found a 
general tendency to change their profile 


patterns in the appropriate direction. 
However, wide individual differences 
between patients in the ability to simu- 

and 


ing 
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late normalcy were observed and only 
11% obtained normal simulated pro- 
files. Gough (1954) requested college 
subjects to respond to the MMPI like 
psychoneurotics, and he found that 
their responses failed to correspond to 
those of neurotics. Even when profes- 
sionally trained clinicians were employed 
as subjects, Gough (1947) reports a 
failure to simulate psychoneurotic and 
psychotic profiles, 

The general conclusion suggested by 
these studies is that MMPI responses 
can be modified under instructed simula- 
tion conditions so that the resultant test 
profiles appear more adjustive and, espe- 
cially, more maladjustive; however, 
specific types of Psychopathology are far 
less susceptible to simulation, 


Special Dissimulation Scales 


Of the three MMPI scales originally 
devised as measures of dissimulation 
(Lie, F, and K), K has received the 


Sreatest research attention. The K scale 
was developed as 


ing defensivenes 
MMPI response. 
pital Patients 


A erivation of K, 
some studies have reported a failure to 


improve MMPI prediction by its use 
(Hunt, 1948; Schmidt, 1948; Tyler & 
Michaelis, 1953). Others have found K 
useful in detecting positive malingering 
(Cofer et al., 1949; Hanley, 1956) anq 
in improving discrimination between nor- 
mal and disturbed college groups (Heil- 
brun, 1963). Smith (1959) challenged 
the interpretation of K as a measure of 
defensiveness by finding a negative 
relationship between K and peer ratings 


of defensiveness in his normal groups. 
He suggested that K should be regarded 
as an index of psychological health, not 
defensiveness, in a normal a 
The positive correlation between K an 
a measure of self-acceptance (Zucker- 
man & Monashkin, 1957) can be inter- 
preted as supporting Smith’s contention. 
Further support for a differential inter- 
pretation of K as a function of es 
population tested was provided by Heil- 
brun (1961) who found decreasing cor- 
relations between K and an independent 
defensive measure with more satisfactory 
adjustment of the college samples stud- 
ied. One clear implication is that the 
denial of psychopathology implicit 1n 
keyed responses on the K scale is less 
likely to represent defensiveness when 
the respondent is within the normal 
range of adjustment; studies employing 
grossly normal subjects might, conse- 
quently, find K to be a less-effective sup- 
pressor variable, 

Several scales have been constructed 
to measure the tendency to endorse 
socially desirable responses on the 
MMPI. Hanley’s Ex scale (1957) 
included items of average popularity and 
neutral in social desirability; zero tO 
moderate correlations with the MMPI 
scales were found, Wiggins (1959) em- 
ployed social desirability role-taking pro- 
cedures to develop his Sd scale, low tO 
moderate relationships to the MMPI 
scales being reported. The SD scale, 
rationally derived by Edwards (1957): 
has been repeatedly found to be highly 
related to scores on the MMPI (Crowne 
& Marlowe, 1960; Edwards, 1961; 
Edwards, Heathers, & Fordyce, 1960; 
Fordyce, 1956; Wiggins, 1959), In addi- 
tion, ratings of social desirability have 
also been shown to be highly related to 
MMPI scale scores (Rosen, 1956; 
Taylor, 1959). 

The studies involving measures of 
social desirability leave little room for 


en 
DŘ 
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doubt that substantial negative covari- 
ance exists between social desirability 
and the MMPI. The greater the prob- 
ability that a subject responds in a 
socially desirable direction on the 
MMPI, the more normal will his test 
profile appear. There remains, however, 
the question of interpretation of this psy- 
chometric relationship which will be 
discussed at some length later in this 


paper. 
Factor-Analytic Studies of the MMPI 


Factor-analytic studies reported be- 
fore 1960 have been consistent in find- 
ing that from one to three (usually two) 
major factors have accounted for most 
of the MMPI scale interrelationships 
whether the samples studied were drawn 
from normal or neuropsychiatric popula- 
tions (Abrams, 1950; Cook & Wherry, 
1950; Cottle, 1950; Kassebaum, Couch, 
& Slater, 1959; Little, 1949; Stout, 
1949; Tyler, 1951; Welsh, 1956; 
Wheeler, Little, & Lehner 1951; Winne, 
1950). Interpretation of factor mean- 
ings has been more variable but even in 
this regard fair agreement exists. Factor 
1 with high loadings on the Sc and Pt 
scales has been labeled “psychotic” 
(Cottle, 1950; Wheeler, 1951), general 
maladjustment (Welsh, 1956), “tend- 
ency to personality maladjustment” 
(Cook & Wherry, 1950), and “ego 
strength versus ego weakness” (Kasse- 
baum et al., 1959). A more inclusive 
Factor 1, interpretable as a general mal- 
adjustment factor, was obtained by 
Stout (1949) and Tyler (1951). A 
second neurotic factor, with high load- 
ings on the neurotic triad (Hs, D, and 
Hy), has also been commonly extracted 
(Cook & Wherry, 1950; Cottle, 1950; 
Tyler, 1951; Wheeler et al., 1951). 

More recent factor-analytic investiga- 
tions of the MMPI have proposed that 
the two primary factors should be inter- 
preted in terms of the test-taking habits 


of the respondent rather than in terms 
of adjustment level. Edwards and 
Heathers (1962) were able to show a 
correlation of —.985 between the SD 
scale and loadings of the MMPI scales 
on the primary factor, and suggested 
that the major source of variance on the 
MMPI is the tendency to give socially 
desirable responses versus the tendency 
to give socially undesirable responses to 
true-false type personality items. Mes- 
sick and Jackson (1961) have proposed 
that the first MMPI factor is better 
interpreted in terms of acquiescence, 
but Edwards and Diers (1962) have 
provided additional evidence that load- 
ings on the first factor can be predicted 
from the proportion of items keyed for 
socially desirable responses on MMPI 
scales or from zero-order correlations 
of the scales with the SD scale. In the 
latest study by Jackson and Messick 
(1962), results were obtained which 
they felt substantiated Edwards’ conten- 
tion that the primary factor is inter- 
pretable in terms of social desirability. 


Implications of the Literature Review 


That performance on the MMPI can 
be intentionally manipulated, given 
simulation instructions, so that the re- 
spondent will appear poorer or better ad- 
justed is reasonably well established, 
although the extent to which dissimula- 
tion affects performance under standard 
instructions is left as a matter for con- 
jecture. ; 

Scales especially devised to detect dis- 
simulation share the common problem of 
distinguishing between socially desirable 
but factual responses of well-adjusted 
subjects and the desirable but false and 
defensive responses of psychopathologic 
subjects. Crowne and Marlowe ( 1960) 
have criticized the Edwards SD scale in 
this regard, since they contend the Kg, 
responses On the SD scale Lad 
socially desirable behavior and psycho 
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pathologic behavior. If such confound- 
ing exists, then the correlations reported 
between dissimulation scales and MMPI 
scales are essentially uninterpretable. 
Covariance between both types of scales 
could be just as well attributed to the 
psychopathology implicit in each as to 
the effects of dissimulation upon 
endorsement of pathology. 
Factor-analytic evidence is not suffi- 
cient to settle the question of the 
MMPI’s vulnerability to dissimulation. 
The level of pathology interpretations 
of the major factor for one array of in- 
vestigators are left to stand against the 
social-desirability interpretation of other 
investigators. The purpose of the next 
section is to consider the alternative in- 
terpretations of the primary MMPI fac- 
tor which have been proposed (i.e, a 
level of adjustment factor versus social 
desirability) » Present some additional 
evidence, and hopefully provide a better 
Tapproachment between personality 
measurement and social-learning theory, 


SocraL-Lrarning THEORY, SOCIAL 
Desras, AND THE MMPI 


Theory of Social Development 
Although it is not 


terize an are primarily 
learned responses. Regardless of which 
learning-theory model one accepts, there 
seems to be general agreement that what 
is learned is contingent in some way 
upon the rewards and punishments 
which accrue as a function of our acts, 
Thus, in his social development the 
child learns to perform certain social be- 
haviors and to avoid others, depending 
in large measure upon the nature of re- 
inforcement imposed by the parents. The 
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parents, in turn, share a common core of 
standards with the larger social group, 
and their reinforcement schedules as well 
as their own behaviors, which serve as a 
model for the child, reflect these com- 
mon standards. These familial learning 
experiences and concurrent experiences 
with peers and other nonfamilial adults 
serve to provide the person with a rela- 
tively stable set of standards of social 
behavior. Some social behaviors are 
“bad” (premarital coitus, unprovoked 
aggression) and their occurrence or even 
the impulse to their occurrence elicits 
anxiety; other social behaviors are 
“good” (friendliness, dependability) and 
social rewards have been and continue 
to be associated with their occurrence. 
Still other social behaviors have been 
neither highly rewarded nor highly pun- 
ished in the child’s social history and are 
best described as neutral. Mussen, Con- 
ger, and Kagan (1963) list deviation 
from cultural expectations as one of the 
major sources of anxiety for the child. 
They state: 


Every culture has an unwritten list of valued 
traits that it expects its members to possess. 
- + + The individual's concept of himself is, to 
a large extent, a function of how closely as 
characteristics approximate the valued traits. 
When the individual perceives a great discrep- 
ancy between his own skill, traits, and tem- 
peramental qualities, and those he feels ae 
should possess, anxiety is generated. The in- 
tensity of the anxiety is related to the degree 
to which the person perceives himself as i 
ating from his own and the culture’s idea 
Standards [p, 147]. 


Individual differences in personal 
standards exist, needless to say, Factors 
such as social-economic class, sex of the 
child, identification model for the child, 
and unique events introduce variable 
patterns of antecedent conditions for 
their development. Yet within a social- 
economic class, like the American middle 
class, there is also a striking commu- 
nality in social standards among its 
members. There is general consensus 3$ 
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to which behaviors are socially desirable 
and which are socially undesirable. 

These broad outlines of social devel- 
opment are generally accorded the name 
socialization, and this aspect of the so- 
cial development process involves learn- 
ing the rules governing the behavior of 
some larger group. Social desirability of 
behavior, as perceived by the individual, 
is a function of the magnitude of posi- 
tive or negative reinforcement imposed 
by the larger group as these have been 
filtered through familial and other learn- 
ing experiences. 
SD Test Response Set 

Cronbach (1946, 1950) defined a test 
response set as a systematic response to 
items independent of item content and 
suggested that such sets contribute prin- 
cipally to error variance, When Ed- 
wards (1959) developed his own Per- 
sonal Preference Schedule, he accorded 
the tendency to endorse socially desira- 
ble characteristics the status of a re- 
sponse set based upon the high correla- 
tion (.87) between the mean social 
desirability attributed to social behav- 
iors by college subjects and the propor- 
tion of such subjects endorsing these 
behaviors as self-characteristic in a 
“yes-no” test format (Edwards, 1953). 
In a more recent statement (Edwards & 
Walker, 1961), “simple psychometric 
considerations” were considered ade- 
quate to account for the SD scale-MMPI 
correlations rather than such “dynamic” 
interpretations as psychoticism versus 
normality. Characteristically, Edwards 
and his colleagues have refrained from 
going beyond the operational definition 
of social-desirability response set pro- 
vided by the SD scale scores in consider- 
ing performance variance on the MMPI. 
Other investigators (Jackson & Messick, 
1958) have taken a middle ground, pre- 
ferring to consider such attributes of 
test performance as acquiescence and 


social-desirability set as “styles” of re- 
sponse which may have more enduring 
and generalized behavioral effects. A 
response style is not necessarily a source 
of testing error, since it may have cri- 
terion relevance. Still others (Crowne 
& Marlowe, 1960; Heilbrun, 1962; Heil- 
brun & Goodstein, 1961; Wiggins, 1959) 
have taken a stronger stand in asserting 
that social-desirability responding on 
tests does have predictive relevance to 
nontest behavior and should not be con- 
sidered a source of testing error. Mar- 
lowe and Crowne and their associates 
have provided rather convincing evi- 
dence that high and low SD test per- 
formers do show differential sensitivity 
to social approval in a wide array of ex- 
perimental tasks (Barthel & Crowne, 
1962; Crowne & Liverant, 1963; Crowne 
& Strickland, 1961; Marlowe & Crowne, 
1961; Strickland & Crowne, 1962). 

A positive relationship must be an- 
ticipated between social desirability of 
responses and actual social behaviors if 
one concurs with the theory of social 
development outlined above. The cor- 
relation of .87 between social desirability 
and endorsement of social behaviors on 
a questionnaire could legitimately be in- 
terpreted as reflecting the strength of 
this positive relationship in middle-class 
college students, The crucial question 
here would seem to be whether bright 
college subjects can and will provide 
veridical self-descriptions when afforded 
the opportunity. If one believes affirma- 
tively, then the high correlation is best 
interpreted as demonstrating the influ- 
ence of the socialization process; other- 
wise, the relationship is best interpreted 
as a reflection of a test-induced defen- 


siveness. 
Social Desirability and Psychopathology 


pathological behaviors 
ably deviant behaviors 
o elicit con- 


Since psychop 
are almost invari 
and social standards serve ti 
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ity from members of a group, it 
tiy that disordered behaviors are 
socially undesirable and normal behav- 
iors are socially desirable. Szasz (1960) 
and Mowrer (1960) have previously 
emphasized the importance of deviation 
from ethical standards in defining mental 
illness. 

Inspection of the 39 MMPI items in- 
cluded in the Edwards SD scale provides 
remarkable support for the proposed re- 
lationship between social desirability 
and psychopathology. Each item is 
keyed for the socially desirable alterna- 
tive and without exception greater social 
desirability appears to be associated with 
endorsement of the more-adjustive be- 
havior option. Crowne and Marlowe 
(1960) report that judges in their study 
also considered the socially undesirable 
options on the SD scale to be indicative 
of maladjustment. Perhaps more con- 
clusive evidence can be extracted from 
two studies by Heilbrun (1960) and 
Goodstein and Heilbrun (1959). Fifteen 
personality variables were rated by 26 


PhD psychologists with regard to their 
adjustive consequences in male and fe- 
male college undergraduates in the 
former study, whereas the latter paper 
presents the personal (in oneself) de- 
sirability of these same behaviors a 
judged by college undergraduates. 
Table 1 gives the rank order of the 1 

variables both with regard to rated ad- 
justment value and personal desirability. 
Social (in others) desirability ranks, 
using Edwards original statement values, 
are also provided for these behaviors. 
The rank-order correlations betwee? 
rated adjustment value of these be- 
haviors and personal desirability is .82: 
the adjustment-social desirability corre- 
lation is .78. Both rho values are highly 
significant (£ = 5.20 and 4.48, respec- 
tively, df = 13, p < .01). These data 


? The personal desirability values assigned tO 
the personality variables represent the average 
values assigned to the nine statements measu- 
ing cach variable. The statement values HA 
obtained by scaling procedures identical 3 
those used by Edwards in estimating the $0 
cial desirability values of the statements. 


TABLE 1 


Raten Apy USTMENT V, 


ALUE, PERSONAL DESIRABILITY, AND SocIAL DESIRABILITY 


OF 15 PERSONALITY VARIABLES 


n Personal Social 
Personality Adjustment desirability desirability 

variable o Value* Rank Tani Raik 

Achievement 2.75 P i $ 
Deference 2.32 

8 11 9 

Order , 2.54 48 s 55 
Exhibition 1.77 1 g 14 
Autonomy 1.70 12 9 12 
Affiliation 2.88 1 2 1 
Intraception 2.33 7 A 8 
Succorance 1.54 13 13 10 
Dominance 2.19 9 3 13 
Abasement 1.25 15 i A 
Nurturance 2.49 6 3 2 

Change 1.96 10 5 4 4 
Endurance 2.54 45 4 5. 
Heterosexuality 2.58 3 7 7 
p 1.30 14 15 15 

Aggression _ Se 


e Values may range between 1.00 (highly maladjustive) and 3.00 (highly adjustive) 
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show a positive and substantial relation- 
ship between what college students con- 
sider desirable behavior and what ex- 
perts consider adjustive behavior. 

It has been argued (Wiener, Blum- 
berg, Segman, & Cooper, 1959) that 
even expert judges in a rating task like 
that used in the Heilbrun study simply 
employ stereotypes of social accepta- 
bility of behavior in making their rat- 
ings rather than the adjustive conse- 
quences of demonstrating these classes 
of behaviors. If so, the correlations of 
82 and .78 would be more parsimoni- 
ously considered coefficients of agree- 
ment between two ratings of desirability. 
Tt should be noted, however, that in the 
Heilbrun study, maladjusted college stu- 
dents were also psychometrically com- 
pared to adjusted students on measures 
of the same 15 personality variables, 
using the Adjective Check List (Gough 
& Heilbrun, in press). The personality 
differences between the adjusted and 
maladjusted groups corresponded with 
what would be predicted from the ad- 
justment ratings of the experts. As an 
illustration, of the 17/30 male and fe- 
male scale comparisons for which signifi- 
cant group differences were obtained, the 
10 traits which were more characteristic 
of the adjusted groups had a mean-rated 
adjustment score of 2.59. The seven 
variables more characteristic of the mal- 
adjusted groups showed a mean score of 
1.51, Since the adjustment scores could 
range only from 1 (maladjustive) to 3 
(adjustive) and there was no overlap 
between the two adjustment-score distri- 
butions, the correspondence between ex- 
bert judgment and empirical finding is 
Striking. These data suggest that the ad- 
justment ratings were in accord with ac- 
tual personality correlates of adjustment 
level rather than responses to desira- 
bility stereotypes. 
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Social Desirability and the MMPI as a 
Measure of Psychopathology 


The clinical scales of the MMPI were 
developed by empirically determining 
differences in item performance between 
clearly pathological groups and normals. 
Accordingly, it has been assumed that 
an appreciable amount of the perform- 
ance variance on the test is attributable 
to degree and type of psychopathology. 
It is appropriate at this point to recon- 
sider the basic question of whether the 
major source of variance for the MMPI 
is attributable to psychopathology or to 
social-desirability response tendencies. 

Heilbrun (1963), in a study of an op- 
timal K-weighting system for prediction 
of maladjustment in a college population, 
determined the biserial correlations be- 
tween each of the 10 MMPI clinical 
scales and membership in normal or psy- 
chologically disturbed groups. Heine- 
man (1952), using college subjects, has 
provided desirability values for the 
MMPI items from which mean desirabil- 
ity values have been determined for the 
same clinical scales. Table 2 presents the 
rank order of these 10 scales from least 
to most socially desirable and two sets 


TABLE 2 


Soctat DESIRABILITY RANK AND DISCRIMINA- 
Tory Power or THE 10 MMPI 
CLINICAL SCALES 


MMPI sD Biserial r” Biserial r| 
Pesto 
scale rank * ry Rank r Rank 
Hs 2 26 6 38 1 
D 4 48 1 31 2 
Hy T 38 4.5 29: 4 
Pd 5 40 3 20 6 
Mf 10 12 10 ar T 
Pa 8 24 75 oD 
Pt 3 38 45 2 5 
Se 1 42 2 30: > 
Ma 9 13.9 04 10 
Si 6 24 75 16 8 


a Least to most. 
è Peychiatric cases Versus normals, 
e Counseling cases versus normals. 
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of biserial correlations ranked in order 
of magnitude. One set represents the 
ordered average correlations for college 
males and females, where the discrimina- 
tion was between normal subjects (W = 
900) and acutely disturbed psychopathic 
hospital patients (V = 100). Averaging 
was performed to be consistent with the 
social-desirability rankings which were 
obtained independent of sex. The sec- 
ond set of correlations represents the 
extent to which the clinical scales dis- 
criminated male college normals (N = 
270) from maladjusted counseling serv- 
ice clients (V = 30). Maladjusted fe- 
males were not included because the 
MMPI proved to be nonpredictive for 
this type of subject. The rank-order 
correlation between social desirability 
and the discriminatory power of the 
clinical scales was .75 for the acutely 
disturbed versus normal subjects and .77 
for the maladjusted male versus normal 
subjects. Both rho values were signifi- 
cant (=3.21 and 3.41, df=8, p< 
02, 01). The more socially undesirable 
the keyed responses on a scale of the 
MMPI, the more successfully that scale 
distinguished the self-descriptions of 
Psychologically disturbed and normal 
subjects. Since the pathological groups 
invariably received the higher scale 
scores, it follows that they were endors- 
ing the keyed, socially undesirable, and 
psychopathological behaviors more fre- 
quently. Therefore, the more socially 
undesirable the behaviors endorsed by an 
individual, the more likely he will eyi- 
dence a manifest psychological disturb- 
ance. 

That psychologically disturbed people 
tend to admit to being Psychologically 
disturbed by their responses to the 
MMPI is an empirically demonstrated 
fact. That their admitted psychologi- 
cally disturbed behavior is socially 
undesirable behavior is also beyond ques- 
tion. There remains, however, one miss- 


ing link before this chain of reasoning 
will reach back to social-learning theory. 
Need the behavioral self-descriptions ob- 
tained on the MMPI bear any relation- 
ship to the same behaviors as they occur 
or have occurred in the natural ger 
test) environment of the individual? T 
a person answers “false” to the item, i 

loved my mother,” is it any more likely 
that he failed to love his mother than the 
person who answers “true”? Hathaway 
and McKinley (1956) avoided this qie 
tion by rendering it irrelevant. Bopig 

cal derivation of the items for the 9¢ 
scale required only that schizophrenics 
answer it differently than normals; ra 
this case, schizophrenics answered z 
“false” with greater frequency. THe 
was no need to show or even be inter- 
ested in whether schizophrenics “really 

failed to love their mothers more fre- 
quently than normals. 

The point of view to be presented here 
is that given a “false” response to the 
mother item, two alternative answers e 
the question of test-response-actuā - 
response equivalence are most Lee 
ably entertained, One, those individue s 
who provide a socially undesirabl 
“false” response to the statement, i 
loved my mother,” are in fact more like y 
to have behaved in this socially unde- 
sirable way than “true” responders; 
further, they are more likely to be psy - 
chologically disturbed than “true ie 
sponders. Two, those individuals by 
Provide a socially undesirable “false oe 
sponse to this item are not in anne 
likely to have behaved in this anes 
socially undesirable way than “true” : 
sponders, but they will be character? es 
by other socially undesirable eee 
(such as answering “false” to the mot 
item); they are still more likely t° ia 
psychologically disturbed. Veridicalit 
of response is not, therefore, a necess# is 
assumption since either alternative y 
consonant with the social-learning the 
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presented earlier; namely, that psycho- 
pathology represents a deviation from 
socially desirable standards of behavior 
and its corollary, the greater the psycho- 
pathology, the greater the deviation. 


Conclusions 


The proposition set forth in this paper 
and explicated by theory and evidence is 
that the dimensions of psychological 
health and social desirability are in large 
measure one and the same. If true, then 
many of the research findings bearing 
upon the relationships of social desira- 
bility to psychological test performance 
have lacked adequate interpretation. The 
positive contingency of social behavior 
adjustment value upon the degree to 
which these behaviors elicit social ap- 
proval has been unduly neglected, pav- 
ing the way for misconstruing the rs 
between test-based measures of social 
behavior and social desirability as neces- 
sary indications of test invalidity. 

Tt is important to point out that some 
of the performance variance on person- 
ality tests is quite likely attributable to 
specific test-taking attitudes describable 
as “faking good” and “faking bad.” The 
former attitude should lead to the en- 
dorsement of socially desirable responses 
and the latter to those which are socially 
undesirable. Thus, it becomes important 
to distinguish between the tendency to 
behave in socially desirable ways, which 
must be mirrored in test performance to 
the extent veridical self-appraisal is 
achieved, and the tendency to respond 
to test items without regard to fact but 
only with regard to the social appearance 


which the person wishes to create by his 


test performance. Block (1962) recently 
has also aligned himself behind the con- 
clusions of this paper. After pointing 
out the necessary relationship between 
Psychological health and socially desira- 
ble behavior, he distinguished between 
a façade or superficial brand of socially 
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desirable behavior and socially desirable 
behavior which is a correlate of adjust- 
ment. The former seems to describe the 
“fake good” set on a test and the latter 
the basic product of social learning ex- 
periences. 
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DOWN’S SYNDROME (MONGOLISM): 
THE HERITABLE ASPECTS 


LISSY F. JARVIK, ARTHUR FALEK, ano W. PHILIP PIERSON 
New York State Psychiatric Institute, New York City 
Striking technological advances in the study of human chromosomes have 


revealed an excess of genetic material in the cells of patients with Down's 
syndrome. Both the trisomic and translocation forms of mongolism are 


discussed, as is the assessment of morbidity risks in relatives of a given 
patient. The origin of the chromosomal abnormalities, and the mecha- 

nisms by which they generate the clinical picture of Down’s syndrome, r 
constitute questions for further research. When this etiological enigma is 
resolved optional management of patients and their relatives will depend, 

even more than at present, upon the active collaboration of psychologists 

and psychiatrists with human geneticists. 


Mental deficiency has been a scourge 
of mankind since early recorded history 
(Barr, 1904), Yet it was only recently 
that the mentally subnormal were as- 
signed to diagnostically useful cate- 
gories, and only during the current 
decade has a promising start been made 
toward Progress from nosology to 
etiology. One of the early workers in 
the area of classification was J. Langdon 
Down who gave a concise clinical de- 
Scription of the syndrome now bearing 
his name and Pointed to the well-known 
Superficial similarities between the dis- 


(Down, 1887) appeared at the same time 
as Mendel’s 1866 classic “Experiments 
in Plant-Hybridization,” (Mendel, 
1866; trans, in Peters, 1959), formulat- 
ing the basic concepts of genetics, It 
was this very science of genetics which 
was responsible for our recently acquired 
understanding of certain biological phe- 
nomena underlying the genesis of 
Down’s syndrome. Like that of Mendel’s 
report, unknown to the scientific com- 
munity for 35 years, the circulation of 
Down’s paper was initially limited, 
Thus, Fraser and Mitchell (1876), 10 
years later, stated that they had “en- 
deavoured to obtain the literature re- 
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lating to this form of idiocy, but without 
success [p. 169].” In the excellent de- 
scription of “this form of idiocy” Fraser 
meticulously recounted the symptoms 
observed in a 40-year-old woman, apn 
pending Mitchell’s notes on 62 similar 
cases. Mitchell suggested that these 
cases could be grouped into a single 
category which he designated as Kalmuc 


idiocy because of the physical resem- ` 


blance of the patients to Kalmucs (mem- 
bers of certain Mongolian triber). 
When Fraser read the paper at 4 
meeting of the Medico-Psychological As- 
sociation in 4875, he was roundly 


criticized by the chairman who stated 
that: 


it struck him that the term Kalmuc was a 
very objectionable term. He thought there kian 
nothing marked about the Kalmuc, and mo 
was no reason for nicknaming an idiot a 
calling him a Kalmuc [Fraser & Mitchell, 1876 
p. 161). . 


The controversy concerning nomencla- 
ture has persisted to this day as show! 


by a letter to Lancet in which 19 sig- ; 


natories advocated the abandonment ee 
the tern®\“mongolism” because of i 
“misleading connotations” (Allen et 4 H 
1961). They did not champion a partic- 
ular substitute but hoped “that agree 
ment on a specific phrase will goor 
crystallize if once the term ‘mongolis™ 
has been abandoned.” 
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The century-old argument regarding 
terminology is clearly of minor import 


_ when compared to questions of etiology. 


Yet prior to the dramatic discoveries of 
1959 information concerning causative 
factors had progressed little beyond 
Mitchell’s astute notations of late birth 
order, frequency of preceding mis- 
carriages or prolonged periods of 
sterility, and advanced parental (espe- 
cially maternal) age (Fraser & Mitchell, 
1876; Øster, 1953, 1956; Penrose, 1934; 
Shuttleworth, 1895, 1909). 

Various postulates regarding the 

etiology of Down’s syndrome were dis- 
cussed in detail by Benda in his well- 
known text (1949, 1960). While these 
hypotheses are far too numerous for 
review here (Ingalls, 1947;. Warkany, 
1960), it may be of interest to recall 
Down’s (1887) own comments in this 
connection: 
Has the nurse dosed the child with opium? 
Has the little one met with any accident? 
Has the instrumental interference which ma- 
ternal safety demanded been the cause of what 
seems to the anxious parents a vacant future? 
Can it be that when away from the family 
attendant medicine has been injudiciously pre- 
scribed? Gan, in fact, the strange anomalies 
which the child presents be attributed to the 
numerous causes which maternal solicitude 
Conjures to the imagination, in order to account 
for a condition, for which any cause is sought 
rather than hereditary taint or parental influ- 
ence [p. 211]? 

The general reluctance to consider 
heritable aspects of mongolism was 
abetted by the paucity of pertinent data. 
Since familial aggregation of mongoloid 
Patients tends to be rare, evidence for 
the operation of hereditary factors was 
essentially limited to the significant find- 
ings in twin studies as well as the oc- 
Currence of minor stigmata of the 
Syndrome in the close relatives of pa- 
tients (Fang, 1950; Penrose, 1954a, 
1954p; Turpin & Caratzali, 1933). A 
Comprehensive evaluation of all twin 


ata then available led Allen and 
t 


Kallmann (1955) to conclude that 
genetic factors were of importance in the 
etiology of mongolism. As of 1957 con- 
cordance rates were close to 100% for 
one-egg pairs, while in two-egg pairs the 
morbidity risk for co-twins was less than 
5% (Allen & Kallmann, 1957). The con- 
clusions reached by Kallmann and his 
associates (Allen & Baroff, 1955; Allen 
& Kallmann, 1955; Baroff, 1958), based 
upon the significant difference in con- 
cordance rates, could not be corroborated 
at the time by examination of children 
born to mongoloid patients. Even today 
(1962), a generation after the introduc- 
tion of modern chemotherapy, only 13 
children (4 afflicted) are known to have 
been born to 12 mothers diagnosed as 
Down’s syndrome (Forssman, Lehmann, 
& Thysell, 1961; Forssman & Thysell, 
1957; Hanhart, 1960; Lelong, Borniche, 
Kreisler, & Baudy, 1949; Levan & Hsu, 
1960; Mullins, Estrada, & Gready, 
1960; Rehn & Thomas, 1957; Sawyer, 
1949; Sawyer & Shafter, 1957; Schlaug, 
1957; Stiles & Goodman, 1961; Thuline 
& Priest, 1961). 

Nevertheless, our level of understand- 
ing of Down’s syndrome today differs 
vastly from that of only 3 years ago. 
The dramatic discoveries made in 1959 
can best be understood in the light of 
knowledge then existing concerning 
cellular aspects of human heredity. 
Although it had long been agreed that 
the nucleus of every cell contained the 
basic hereditary material, it was not until 
1956 that methods became adequate 
for cultivating human cells and studying 
their chromosomes. 

By 1959, a sufficient number of 


reports had appeared in the literature 


(Chu & Giles, 1959; Ford & = : 
er & Lajtha ; 
1956; Ford, Jacobs $ jio & Puck, 


ss 1956; 
Tjio & Levan, ify the tenet that 


1958a, 1958b) to justi 
the normal human complement a 
pairs or 46 chromosomes. Among 
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are two sex chromosomes (one pair) 
which distinguish between the sexes. In 
the female these two chromosomes are 
the same (XX) while in the male they 
are different (XY). In addition to the 
one pair of sex chromosomes there are 
22 other pairs, apparently the same for 
both sexes, which are known as auto- 
somes. The 22 autosome pairs together 
with the sex chromosomes comprise the 
normal diploid complement of 46 chro- 
mosomes.' The autosome pairs have been 
assigned numbers from 1 to 22 according 
to their relative size and configuration 
and the standard nomenclature adopted 
by the original workers in the field is 
known as the Denver system (Denver 


Utilization of the newer techniques of 
cell microscopy 
(cytogenetics) soon led to the Startling 


xtra chromosome in 


€ report 
Gautier, and Turpin 
(1959a) who had analyzed the chromo- 
Somes of three mongoloid boys. The 


Occurrence of the extra small chromo- 
some 


I yndrome affirmed 
the characteristic number of 47, the 


extra small chromosome being desig- 
nated as number 21 (Böök, Fraccaro, 
& Lindsten, 1959; German & Bearn, 


1960; Hanhart, Delhanty, & Penrose, 
1961; Lejeune, Gautier, & Turpin, 
1959b; Makino, Honda, & Mitani, 


1962; Miller, Breg, Schmickel, & 
Tretter, 1961). The cells of such individ. 
uals then contain three, instead of the 

1 Prior to the above noted methodological 


advances the human diploid number was 
believed to be 48. 
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normal two, chromosomes number 21 
and persons with this abnormality 
(trisomy number 21) are referred to 
as 2l-trisomics. 

The origin of the extra chromosome, 
the third one, is, of course, an intriguing 
problem. The two chromosome mem- 
bers of each of the 23 pairs ordinary 
present are contributed by the panne 
germ cells (egg or sperm). As has 4 
been known, each germ cell, or gamete, 
contains not the diploid number of or 
mosomes present in all other cells _ 
only half as many, the haploid gery i 
Thus only one member of each of th 
23 chromosome pairs is represented mM 
the germ cells (23 instead of 46 a 
mosomes). During fertilization the pom 
bination of egg and sperm, with 23 = 
mosomes each, reconstitutes the — o 
diploid number of 46 in generation afte 
generation. ~~ 

The process by which cells oonan 
ing only one half of each chromosom 
pair are formed is called meiosis a 
involves reduction division gece i 
to one half the diploid number of chr 
mosomes). It is beyond the scope 
the present discussion to examine 
detail the intricate biological lana 
that are encompassed by the terms Hie 
spermatogenesis and odgenesis (mito “ 
as well as meiotic divisions, on 
over, formation of polar bodies, a 
but it is sufficient for our purposes k 
examine the final result consisting "i 
eggs and sperm with but one membe” e 
each chromosome pair. When for ae 
reason, as yet unknown, a ee is 
pair does not separate during es 
there arises a germ cell with 24 ee 
of 23 chromosomes (Miller, Mittwor 5 
& Penrose, 1960). The union of this tyP 
of cell with a normal germ cell resu 
in an individual with 47 rather tha? 
chromosomes. this 

In cases of Down’s syndrome pio 
process (nondisjunction) affects © 
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Fic. 1. Meiosis, nondisjunction and translocation. (In normal meiosis, diploid number of 
chromosomes is reduced to haploid number. Fertilization reconstitutes the diploid complement. 
In mongolism resulting from nondisjunction, chromosome pair number 21 fails to separate 
during meiosis and fertilization results in trisomy number 21. In mongolism due to transloca- 
tion, the genetic material of chromosome number 21 is attached to another chromosome—for 
example, number 15. The independent assortment of chromosomes produces four possible com- 


binations: mongol, normal, carrier, lethal.) 


= + =a 


FERTILIZATION 


TRISOMY 


so that the diagnosis of Down’s syn- 
drome can be made through the culture 
and examination of cells from a variety 
of sources (e.g., skin, bone marrow, or 
blood). 

Persistent application of these cyto- 
genetic procedures to typical patients 


mosome pair number 21 (see Figure 1). 
Although nondisjunction is liable to oc- 
cur in both egg and sperm cells, for pur- 
poses of illustration we may consider an 
ovum where the two members of chro- 
mosome pair number 21 remained to- 


gether. When a normal sperm contributes 
its chromosome number 21 to such an soon revealed that some of them had 46 


ovum, the fertilized egg becomes triso- instead of the expected 47 chromosomes. 
mic for chromosome number 21. It is How could the presence of Down’s syn- 
likely that only a small proportion of the drome be reconciled with the absence of 
gametes (germ cells) in a given parent a third chromosome number 21? If the 
are so affected and, unfortunately, we syndrome is attributed to an extra 
still lack the means to detect this process. amount of genetic material from chro- 
It is only after a mongoloid child has mosome number 21, then it is ieenl 
been born that we know nondisjunction to assume that in patients w K 
occurred in a parent’s gametes. In the chromosomes this excess IS attached to 


afflicted child all of the body cells are one of the other chromosomes. ee 
trisomic for chromosome number 21 assumption of a translocation 
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borne out by the finding of excessively 
long chromosomes in such patients. 
The translocation most frequently re- 
ported involves the attachment of 
chromosome number 21 to a chromo- 
some in the 13-14-15 group (so-called 
15/21 translocation) while in other in- 
stances combinations of 22/21 and 21 / 
21 have been observed in translocation 
mongolism (Atkins, O'Sullivan, & Pryles, 
1962; Biesele, Schmid, Lee, & Smith, 
1962; Böök, Santesson, & Zetterqvist, 
1961; Carter & Evans, 1961; Carter, 
Hamerton, Polani, Gunalp, & Weller, 
1960; Ek, Falk, Bergman, & Reitalu, 
1961; Fraccaro, Kaijser, & Lindsten, 
1960; Hamerton, Cowie, Giannelli, 
Briggs, & Polani, 1961; Hamerton & 
Steinberg, 1962; Lehmann & Forssman, 
1962; Macintyre, Staples, Steinberg, & 
Hemple, 1962; Penrose, Ellis, & Del- 
hanty, 1960; Polani, Briggs, Ford, 
Clarke, & Berg, 1960; Scherz, 1962; 
f Turpin & Lejeune, 1961; 
Turpin, Lejeune, Lafourcade, & Gautier, 
s have been de- 


i veral members of 
Succeeding generations, The transloca- 


observed not only 
but also in ap- 
ts and siblings, 
n them reflected 
chromosomal ma- 


terial, unaffected translocation carriers 


having 45, rather than 46, chromosomes, 
The apparently missing chromosome in 
these translocation carriers belonged to 
pair number 21 and was not really 
absent since it was attached to one of the 
other chromosomes. As illustrated dia- 
grammatically in Figure 1, the unaf- 
fected carrier has the usual allotment of 
genetic material, despite the apparent 
shortage of one chromosome. 

Although the appearance (phenotype) 
of translocation carriers gives no clue to 
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their chromosomal abnormality, present 
cytogenetic techniques can eadi 
demonstrate the chromosomal anomaly. 
Persons who inherit the translocation 
chromosome from one of their parents 
show the abnormality in all of their 
body cells. This is a logical cone 
of the fact that all of their cells evolve 
from a single fertilized ovum. age’ 
cally, one half of their germ cells, me 
carry the haploid complement, will u 
tain 23 and the other half only 
chromosomes. While all of the gametes 
with 22 chromosomes lack a Taorphaloi y 
cally discrete chromosome number Ei 
some of them contain the translocation, 
If it is true that absence of the genen 1 
material from chromosome number aa 
is incompatible with reproduction ve 
individual has yet been found with “i 
equivalent of only one chromosome 
number 21), then there remain me 
kinds of germ cells which can yield vi d 
ble offspring, one gamete with 22 ee 
two with 23 chromosomes. Since hs 
gamete with 22 chromosomes conta! r- 
the translocation, fertilization by & min 
mal germ cell will result in a phenotyp 
cally normal individual with 45 chrom 
somes. One of the two gametes a 
23 chromosomes also contains the on 
location, in addition to a en 
chromosome number 21, When suc Fi 
germ cell is fertilized by a nates 
gamete there is an excess of cea 
Some number 21 material which resu 
in a child with Down’s syndrome. no 
the third type of germ cell the eE 
Somes are normal in configuration as d to 
as in number and may be expecte! h 
give rise to a normal individual. Sv ios 
Person does not carry the transloca, J 
and, therefore, cannot transmit Í 
future generations, e 

Since the translocation chromoso™ 
transmitted by unaffected carriers op 
two thirds of viable offspring. in alf 
succceeding generation, and sincè omer 
of those will manifest Down’s synd" 


is 


| 
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why then do we not have an abundance 
of case histories with affected siblings, 
cousins, aunts, uncles, and other relatives 
in collateral lines? After all, investiga- 
tors have been searching for etiological 
factors throughout the past century and 
would surely not have overlooked such 
a family accumulation. Indeed, multiple 
cases have been reported for certain 
families occasionally, and we are now 
able to explain the relative rarity of 
such pedigrees. Obvious familial aggre- 
gation is expected only in the form of 
Down’s syndrome due to translocation 
and the overwhelming proportion ex- 
amined so far have shown trisomy rather 
than translocation. 

As mentioned earlier, the trisomic 
form of mongolism is characterized by 
an extra or third chromosome number 
21 which is indistinguishable from the 
other two chromosomes number 21 and 
the affected individual has a total of 
47 chromosomes. 

Whatever the mechanism, trisomy or 
translocation, there must be postulated 
some relationship between the excess 
chromosomal material and clinical symp- 
tomatology. In fact, most individuals 
containing cells with both the normal 
and deviant chromosome complements 
manifest relatively mild stigmata 
(Clarke, Edwards, & Smallpiece, 1961; 
Fitzgerald & Lycette, 1961; Hayashi, 
Hsu, & Chao, 1962; Ilbery, Lee, & 
Winn, 1961; Lindsten, Alvin, Gustavson, 

Fraccaro, 1962; Nichols, Coriell, 


Fabrizzio, Bishop, & Boggs, 1962; 
Richards & Stewart, 1962). Equating 
c infor- 


chromosomal matter with geneti 
mation, we may say that Down’s syn- 
drome results from an extra amount of 
DNA (deoxyribonucleic acid), the 
macromolecule embodying the genetic 
code, 

It is beyond the scope of the present 

iscussion to enter into the physiochem!- 
cal intricacies of the DNA molecule 
Whose double helix structure was eluci- 


dated by the 1962 Nobel laureates, 
James D. Watson, Francis H. C. Crick, 
and Maurice H. F. Wilkins. The Wat- 
son-Crick model (Watson & Crick, 
1953) has been compared to a ladder 
with the legs twisted about each other. 
The legs of the ladder stand for the 
sugar-phosphate complexes and the 
rungs represent the paired bases (ade- 
nine-thymine and  guanine-cytosine). 
The order in which the base pairs are 
arranged apparently determines the 
genetic code and it is easy to imagine 
how a surplus of coded information (ex- 
cess DNA) could disturb the complex 
copying processes, thus interfering with 
normal development. 

Although the excess genetic material 
from chromosome number 21 is now 
known to culminate in Down’s syndrome, 
the conditions generating the chromo- 
somal aberration remain largely un- 
known. As has been described, trans- 
location chromosomes are transmitted 
according to well-established laws of 
heredity but the forces creating the first 
translocation in any family await defini- 
tion. Similarly, the circumstances con- 
ducive to nondisjunction are yet to be ex- 
plained. Recalling moreover that modern 
methods for the study of human chromo- 
somes have been available for just over 
5 years and that even these are still 
somewhat primitive, it may be expected 
that increased technical sophistication 
will lead to the detection of still further 
abnormalities in Down’s syndrome. It 
cannot be emphasized too strongly that 
the discovery of chromosomal aberra- 
tions is but one step toward the detection 
of causal factors in mongolism. bird 
of the hypotheses proposed durin id 
past century are still tenable provi 

: inking to include the 
we adjust our thinking tigna:In 
origin of chromosomal aberra iologi- 

eformulating the etioiog 
a way we are T amenable to 
cal problem ata level ee Sant thd of 
experimental manipula x 
clinical symptomatology- 
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he factors held of basic im- 
nce bs the past have been ionizing 
radiations, viruses, and senescence. 
Radiation-produced translocations are 
well known in plants and virally induced 
changes have been studied extensively 
in microbial genetics. Though the effects 
of senescence in man are still subject 
to speculation (Jarvik, 1963), observa- 
tions relating an increased frequency of 
mongolism to advanced parental age 
date back to the early descriptions of the 

disorder as mentioned in our introduc- 
tory section. On the basis of cytogenetic 
data it has been suggested that the re- 
lationship between parental age and 
Down’s syndrome applies primarily to 
the trisomic form of mongolism result- 
ing from nondisjunction (Carter & 


Evans, 1961; Hamerton, Briggs, Gia- 
nelli, & Carter, 1961; Penrose, 1961, 
1962), 


As a matter of fact, the association 
with advanced parental age has been 
noted for nondisjunction of chromo- 
Somes other than number 21. In Kline- 
felter’s syndrome, for example, it is two 
X chromosomes which remain joined, 
and together with a Y chromosome give 
rise to abnormal males (XXY instead of 
XY) characterized by gynecomastia, 
hypogonadism, and sterility, The dis- 
order affects about 1 in every 500 new- 
born boys (Bergemann, 1961; 
Harnden, & Court Brown, 1961; Moore, 
1959) or with approximately the fre. 
quency (1 in 650) found in Down’s 
syndrome (Bearn & German, 1961; Pen- 
rose, 1961). 

A predisposition to hondisjunction, 
possibly enhanced by advancing age, has 
been postulated with respect to Kline- 
felter’s as well as Down’s syndrome 
(Lubs, 1961; McKusick, 1962), More- 

over, the observed number of individuals 
with coexistence of both aberrations, 
XXY and trisomy number 21 (with 
a total of 48 chromosomes), has been 
greater than that expected by chance 


Maclean, 
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(Ford, Jones, Miller, Mittwoch, Pen- 
rose, Ridler, & Shapiro, 1959; Harnden, 
Miller, & Penrose, 1960; Lanman, 
Sklarin, Cooper, & Hirschhorn, 1960; 
Lehmann & Forssman, 1960). Chance 
expectation has also been ana? 
the association of mongolism witl 
leukemia and the significance of this 
relationship is being explored (Buckton, 
Harnden, Baikie, Woods, 1961; Davis 
& Korst, 1960; Krivit & Good, 1957: 
Tough, Court Brown, Baikie, piee 
Harnden, Jacobs, King, & McBride, 
1961). Though considerably more data 
will be required before any definite con 
clusions can be reached with regard to 
etiology, the information gathered He 
date (1962) does serve to highlig 
numerous approaches toward pens 
the cryptic processes which lead A 
mongolism. With respect to parental ar : 
effects, for example, it should be profi ; 
able to examine whether nondisjunction 
ensues from changes intrinsic in or a 
trinsic to the ovum and whether P 3 
normal ova are actually responsible A 
the majority of mongoloid individuals z 
claimed on statistical grounds. W on 
preliminary clues may be derived se 
animal experiments, final answers rue 
have to come from human material. ae 
widespread use of oral contraceptive’ 
resulting in the retention of large er 
bers of ova, otherwise discharged r 
monthly intervals, will provide inform a 
tion hitherto unavailable for huma 
females, se 

In addition to defining conditio”® 
favorable or unfavorable for the, e 
velopment of chromosomal aberratio a 
there is a need for delineating the at 
of the redundant genetic material. 
may speculate that the excess 
lends hi an overabundance of substance” 
ordinarily produced by the ca een 
(normal metabolites) and that such P 
fusion is harmful. Familiar examp i- 
of noxious effects correlated with qua” e 
tative increases in metabolites 


a - — 
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diabetes and thyrotoxicosis. Demonstrat- 
ing this type of mechanism in Down’s 
syndrome would encourage a causally 
directed, therapeutic orientation. So 
would the equally plausible suggestion 
that the anomalies seen in Down’s syn- 
drome reflect a deficiency rather than an 
excess of some normally required meta- 
bolites. Furthermore, the similarities 
between mongolism and cretinism, so 
striking that they create diagnostic diffi- 
culties, may also be cited in support of 
this view. That a deficiency hypothesis 
is compatible with replacement therapy 
is illustrated by cretinism where the 
timely administration of thyroid hor- 
mone may not only halt but even re- 
verse the development of the physical 
and mental stigmata of the disorder. 
Conceivably more than one substance 
and more than one mechanism may 
ultimately be implicated in Down’s 
syndrome (Gershoff, Hegsted, & Trul- 
son, 1958; Jérôme, Lejeune, & Turpin, 
1960; King, Gillis, & Baikie, 1962; 
O’Brien & Groshek, 1962). It would 
become mandatory then to distinguish 
for therapeutic purposes between differ- 


ent forms of the disorder, just as it is 
to differentiate for 


necessary now 
counseling purposes trisomy from trans- 
location. 


a of family counseling 


that a working knowledge of modern 
genetics may be said to constitute a sine 
qua none for the psychologist. Today, 
relatives of a mongoloid patient can be 
given a reasonable estimate of the proba- 
bility that they themselves will have an 
afflicted child (ranging from less than 
002% to over 30%), provided the in- 
formed counselor can obtain accurate 
Cytogenetic data. 

Because of the complex evaluation 
required for the correct assessment of the 
facts in a given counseling situation, it 
is imperative that anyone dispensing 
genetic advice be fully familiar with the 
Current genetic literature and qualified 


It is in the are 
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to interpret cytogenetic, biochemical, and 
routine clinicopathological reports 
(Hirsch & Erlenmeyer-Kimling, in 
press). It is hardly necessary to specify 
for psychologists the need for competent 
evaluation of psychological factors. 
Quotation of accurate facts and figures 
without consideration of their impact 
upon the individual seeking advice may 
have devastating consequences. 

Since sophistication in both clinical 
psychology (or psychiatry) and human 
genetics is rarely found in a single indi- 
vidual, several persons, each having a 
portion of the requisite training, may 
profitably work together on a counseling 
problem (Kallmann, 1961; Kallmann & 
Rainer, 1963). With forthcoming knowl- 
edge of the etiology and pathogenesis 
of Down’s syndrome successful thera- 
peutic reforms will no doubt be insti- 
gated so that the heritability of this 
mental dysfunction may eventually be 
identified with curability. 
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NOTE ON “PERSONALITY VARIABLES AND 
RESPONSE TO COLOR” 


IRA M. STEISEL 
St. Christopher's Hospital for Children and Temple University Medical School 


Some criticisms of an article by Cerbus and Nichols (1963) relating to 


their report of author's work. 


In their recent review of the literature 
on the relationship between the response 
to color and variables of personality, 
Cerbus and Nichols (1963) had occa- 
sion to refer to an earlier study of mine 
(Steisel, 1952) which attempted to relate 
some scores of the Rorschach test and 
measures of suggestibility. In their re- 
port there were a number of errors (both 
of omission and commission) which, per- 
haps, warrant correction so that the mis- 
takes will not persist even though the 
overall conclusions of their review might 
be unaffected. 

In the first place, they (Cerbus & 
Nichols, 1963) state that a confederate 
was used in an effort to influence the 
estimates of the amount of movement in 
the autokinetic situation (p. 569). No 
confederate was involved; rather the ex- 
perimenter told the subject the “true” 
distance (actually a prearranged one). 

Secondly, there were additional meas- 
ures of suggestibility used in the original 
study beyond the change of estimate of 
movement as influenced by the experi- 
menter’s statements. Hull’s Body-Sway 
test (1933) and the Ink-Blot Suggestion 
test of Eysenck and Furneux (1945) 
were employed. In addition to these, a 
latency measure and the mean amount of 
movement reported (prior to subjects 
being given a suggestion of “correct” 
distance) were derived from the auto- 
kinetic procedure. 

Despite their comment (Cerbus & 
Nichols, 1963, p. 569), Sum C was not 
utilized; although there were three other 


measures of color responsivity obtained 
from the Rorschach: the absolute and 
the percentage of responses scored as CF 
as well as the average color score which 
had been proposed by Williams (1947). 
Nine additional Rorschach measures 
were reported. 

Perhaps what is most distressing is 
that the two correlations (of a total of 
72 reported) were singled out for dis- 
cussion by Cerbus and Nichols (1963, 
p. 569). It is here contended that se- 
lecting them is a questionable procedure 
in light of the original finding that a 
total of only three correlations were 
greater than zero at the 5% level of 
confidence or beyond. The inescapable 
conclusion from the study, alas, is that 
no relationship exists between the 
Rorschach scores and the measures of 


suggestibility employed. 
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SIGNIFICANCE LEVELS OF INDIVIDUAL 
CORRELATIONS SELECTED FROM 
MULTIVARIATE STUDIES 


GEORGE CERBUS 
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AnD ROBERT C. NICHOLS 
National Merit Scholarship Corporation, Evanston, Illinois 


Reply to criticisms of authors’ report of Steisel’s (1952) work. 


Steisel (1964) has elaborated on our 
reviews scant coverage of his study 
(Steisel, 1952), which was due to the 
necessarily selective nature of a review 
and to our evaluation of his original re- 
port. As Steisel implied, his clarifications 
do not change our interpretation of his 
findings. j 

Steisel’s criticism of our selective re- 
porting of his findings shows a misunder- 
standing of probability levels, which may 
or may not be affected when individual 
correlations are selected from a multi- 
variate study depending on the method 
of selection. When one tests an hypothe- 
sis he is implicitly selecting a single rela- 
tionship for study from an infinite num- 
ber which could be computed. We 
focused on two of Steisel’s correlations 
which were pertinent to the point being 
discussed and which provided partial 
replication of another study. Since these 
results were selected because of the na- 
ture of the variables, not the size of the 
correlations, the number of correlations 


400 


computed between the other (irrelevant) 
variables in the same study does not 
affect the significance level of the selected 
correlations. , 

We would like to second Cattell’s 
(1958) recommendation that authors of 
reviews scan multivariate experiments 
for variables pertinent to the topics ie 
viewed (as was done with Steisel’s 
study). As long as correlations are sè- 
lected because of the nature of the vari- 
ables involved and not the size of the 
relationship, the usual probability levels 
apply. 
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ZERO CORRELATIONS AMONG TESTS OF 
INTELLECTUAL ABILITIES 
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It is often asserted that intercorrelations among tests of intellectual abili- 


ties are 
sup) 
Empirical 


almost always positive, justifying the belief in a “g” factor and 
porting the choice for oblique rotations of axes in factor analyses. 
information is presented from correlation matrices obtained in 


13 typical analyses of intellectual measures showing that among more 
than 7000 correlation coefficients, 17-2490 can be considered to be 0. It 
is estimated that under optimal conditions, when there are 15 factors, as 
few as 11% would need to be O to determine a clear, orthogonal simple 


structure. 


Of a number of statements in the 
psychological literature that achieve the 
status of clichés, or nearly so, the asser- 
tion that practically all tests of intel- 
lectual abilities correlate positively is 
one of the most common. The statement, 
when it appears, can also be taken to 
imply significantly nonzero correlations. 
Tt is commonly used as “proof” that 
there is a universal or g factor on the 
one hand, and as logical support for the 
practice of oblique rotational solutions 
in factor analysis on the other. In dis- 
cussing these points in a previous article 
(Guilford & Zimmerman, 1963), it was 
asserted that “hundreds of them (zero 
correlations) can be exhibited.” It is the 
purpose of this note to support that 
statement by citing some empirical in- 
formation. 

Some years ago, 
could not be so con 


the quoted statement 
fidently made. In the 
earlier efforts directed toward the analy- 
sis of intellectual abilities into their 
unique components, when most of the 
present-day knowledge of intellectual 
factors and sophistication in applica- 
tions of factor analysis were unavailable, 


many of the analyzed tests did inter- 
correlate substantially, or at least posi- 
tively. From what is known now, there 
were ‘essentially two reasons for those 
higher correlations. One reason was that 
the conception of intelligence was a rela- 
tively limited one, consequently, tests 
designed for this area of aptitude were 
less varied and more restricted to cer- 
tain kinds that had much overlapping 
of factorial content. The other main 
reason was that the tests were factorially 
more complex; consequently the chances 
of one test having something in common 
with another in terms of common-factor 
variance were increased. 

On the basis of present-day informa- 
tion, we can recognize the kinds of tests 
that are required to represent factors 
more univocally. It is easier to keep the 
factorial complexity of tests at a lower 
level, although complete success in this 
respect is still far from perfect. It is a 
matter of writing tests in such ways as to 
achieve the necessary experimental con- 
trols that are needed to focus examinees’ 
efforts and methods in the favored di- 
rections. The development of the struc- 
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ture-of-intellect model, with its pointed, 
empirical definitions of intellectual 
abilities, demonstrated or still to be 
demonstrated, has helped considerably 
in guiding test construction in more 
definite directions. The very great 
broadening of the conception of the 
great domain of human intellect has also 
brought into consideration varieties of 
abilities not dreamed of before, thus 
leading to greater varieties of tests. 


A SURVEY or Some CORRELATION 
MATRICES 


the form of Psychological tests. 
Before considerable 
thought is number and 
nature of unique abilities involved 
within a circumscribed area of function- 
ing, deriving hypotheses concerning what 
kinds of abilities will be demonstrated 
by the analysis. Tests are developed 
with an effort to measure each hypothe- 
sized ability and that ability only. Each 
test is designed to measure something 
very different from the basic abilities 
measured by tests designed for Other 
basic abilities. Consequently, there is a 
frank effort to achieve minimal correla. 
tion between every pair of tests of two 
different factors. If the two factors are 
completely uncorrelated in the popula- 
tion of examinees, the goal of a zero 
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correlation is a realistic one. Any corre- 
lation greater than zero between two fac- 
tors limits the possibility of achieving a 
zero correlation between the pairs of 
tests for those two factors, so long as the 
tests have nonzero reliability. 

Thus, one may say that, under the 
circumstances of the studies represented 
in the data, conditions were about as 
favorable as it has been possible to 
achieve, during the past 15 years. Fur- 
ther gains in sophistication might permit 
a more general improvement in reduc- 
tion of intercorrelations where factors 
are not in common to tests. On the other 
hand, a more random selection of tests 
for factor-analytic batteries could cer- 
tainly be expected to reduce the propor- 
tions of zero or neat-zero correlations. 
The latter conditions would hardly be 
tolerated by those who wish to make 
Progress in isolating new factors, 

Considering all 23 analyses of intel- 
lectual abilities that have been per- 
formed by the Aptitudes Project, 19 of 
which have been reported in the litera- 
ture, 13 were selected for use in the sur- 
vey of this note. Selection was made 
without regard to inspection of the cor- 
relation matrices, It aimed to use the 
more representative analyses, in terms of 
the kinds of test variables employed. For 
example, matrices were rejected where 
more than two variables were nonintel- 
lectual (such as a test for finger speed 
or the variable of sex membership), with 
one exception of selection of one study 
that had three such variables. Where 
one or more such variables were in- 
volved, they were excluded in the tally 
of correlation coefficients, for they pro- 
vided biased numbers of near-zero cor- 
relations. Variables were also excluded 
whose communalities were less than .30, 
since low communality puts a limitation 
on the size of correlation coefficients- 
Such instances were extremely rare in 
the matrices used. In three analyses 
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(Report Number 26), one variable 
(Reading Comprehension) was un- 
doubtedly a factorially complex score, 
which might have increased its chances 
of nonzero correlations. But most of 
the other variables were tests of diver- 
gent-production abilities, which usually 
correlate low, even zero, with tests of the 
traditional types of IQ tests, such as 


Reading Comprehension. 
This kind of selection left the use of 


correlation matrices from Reports from 
the Psychological Laboratory, Numbers 
3, 8, 9, 16, 17, 18, 23, and 26.* There 
was one matrix each from most of these 
reports, there were three from Report 
Number 16, and four from Report Num- 
ber 26. Most of the analyses were based 
upon subjects who were males with gen- 
erally higher-than-average IQ levels, 
near the age of 20, in military training 
that ordinarily led to officer commis- 
sions. In Report Number 26 the sub- 
jects were ninth-grade students in two 
southern California high schools. All 
samples exceeded 200 in number, with 
an N of about 225 being rather typical. 
The coefficients were usually Pearson 
rs, or estimates of Pearson 7’s from 
point biserials or cosine-pi estimates 
from 2X2 contingency tables.? 


Results 


The frequency distribution of all co- 
efficients of correlation from the 13 
matrices is presented in Table 1, with 
proportions of the coefficients within 
each class interval and also cumulative 
proportions. The total range is from 
three r’s that fell below —.2 to a single 
r that was above +.7. If we define a 
zero 7 as being within the range from 
—.1 to +.1, we see that .1357+.0357 
=.1711 of the coefficients are essentially 


1See Michael, Comrey, and F ruchter (1962) 


for a listing of all these reports. 
2I am indebted to Gail O’Connor for ma- 


terial assistance in compiling this statistical 
information. 
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TABLE 1 


SUMMARY STATISTICS ON DISTRIBUTIONS OF 
COEFFICIENTS OF CORRELATION IN 13 
FACTOR ANALYSES OF TESTS OF 
INTELLECTUAL ABILITIES 


Class intervals 


ofr Í Ż cp 
7 to .799 1 0001  .9999 
6 to .699 18 .0025 .9998 
5 to 599 121 0171 9973 
4 to 499 561 .0792 .9802 
.3 to 399 1373 .1939 .9010 
.2 to .299 2005 .2831 .7071 
1to .199 1743 .2461 4240 
0 to 099 959 1354 .1779 
—.1 to —.001 253.0357 0425 
—.2 to —.101 45 0064 .0068 
—.3 to —.201 3 0004 = .0004 
Sum 1602.50 7082 .9999 
Mean .226 


zero in value, or about 17%, or one 
sixth. Below the level of —.1 are 48 
coefficients, or about two thirds of 1% 
of the 7’s, which function like zero cor- 
relations in indicating independence of 
factors and suggesting orthogonal rota- 
tions of axes. The percentage of correla- 
tions below +.1 varies a great deal 
from one analysis to another, the range 
being from 5% to 34%, where the per- 
centage over all samples is approxi- 
mately 18. 

The choice of r’s below +.1 in defin- 
ing a zero coefficient is an arbitrary one. 
Let us consider a rational guide, in the 
form of an estimate of sampling errors 
and statistical inference. Considering 
that the typical size of sample is 225, as 
mentioned earlier, the standard error of 
r, when the population value of 7=.0, is 
067. A departure from zero of two 
standard errors is then .134, not .100. 
The null hypothesis cannot be rejected 
at the .05 level of confidence with ob- 
tained 7’s within the range from —.134 
to +.134, which would include a sub- 
stantially higher proportion than the 
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—.1 and +.1; possibly 
fe as 24%. This estimation 
is admittedly very rough, for some ex- 
perimental samples of examinees con- 
tained far more than 225 and none was 
smaller than 200. 


SOME FURTHER CONSIDERATIONS 


From this rather extensive informa- 
tion of a typical nature, the evidence is 
quite clear that zero correlations are 
quite common among tests of intellectual 
abilities. The general level of such co- 
efficients is indicated by a mean r of 
-226.° The distribution of r’s about that 
central value is nearly symmetrical, with 
a bit of positive skewing. There are a 
number of conditions that should lead 
us to expect the form that such a distri- 
bution should take. 

Let us consider the special case of a 
perfect, orthogonal, simple structure; 
that is, each test loaded on only one 
factor and the factors uncorrelated. 
Thus, if there were m factors and & 
tests per factor, there should be 
mk(k-1)/2 positive correlations and 
k°m(m— 1)/2 zero correlations. To take 
a somewhat typical, realistic, case with 
15 factors and 3 tests per factor, the 
number of positive correlations would 
be 45 and the number of zero correla- 
tions 945; or a total of 990, which 
should equal km(km—1)/2, In this 
special case, about 95% of the r’s would 
be zero, which is far above the 17% or 


3 Averaging the corresponding Fisher z val- 
ues gave a very similar result, 


even the 24% that were found empiri- 
cally. f 

But such extreme or ideal instances 
are never found, nor should they be ex- 
pected; nor are they needed in achieving 
an orthogonal simple-structure solution. 
With graphic rotations of axes, at least, 
so long as there is one zero correlation 
between one pair of tests, each repre- 
senting one of a pair of the factors, a 
clear, orthogonal solution can be 
achieved. This means, of course, that 
the much smaller number of zero cor- 
relations would have to be optimally dis- 
tributed, but it would not be limited to 
any one particular distribution, among 
pairs of tests. The number of such zero 
correlations can be surprisingly small. 
This number is given by the quantity 
m(m—1)/2, which, in the case of 15 
factors would be 105. Out of the total 
of 990 7’s this would be approximately 
11%, which is distinctly below the 
empirical 17-24%. Where the zero 7’ 
are not located according to one of the 
optimal distributions among the pairs, of 
course, there would be departure from a 
complete, orthogonal solution. On the 
other hand, psychologically satisfying 
solutions have been obtained with less 


than 11% of the correlations in the cate- 
gory below +.1. 
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EXPERIMENTER OUTCOME-ORIENTATION AND 
THE RESULTS OF THE PSYCHOLOGICAL 
EXPERIMENT* 


ROBERT ROSENTHAL 
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This review considers evidence bearing on the question of whether an 
E’s orientation (expectations and wishes) toward the results of his re- 


search may partially det 


ermine those results. The evidence is drawn from 


everyday life, medicine, clinical psychology, survey research, and experi- 


mental behavioral research. 


In any science, experimenters have 
some orientation towards the outcome or 
results of their research. Rarely is this 
orientation one of truly dispassionate 
disinterest. Variables are not chosen for 
inclusion in research by using tables of 
random numbers. They are, rather, 
chosen because the experimenter has cer- 
tain expectations about the relationship 
or lack of relationship between the se- 
lected variables and certain other varia- 
bles. A superficial exception to this 
might be seen in so-called heuristic hunts 
for relationships, which are perhaps more 
common to the behavioral sciences. Even 
here, however, the inclusion of variables 
is not on a random basis, and certain 
relationships appear more likely to be 
found than others. 

Experimenters then often, if not al- 
ways, have some sort of expectations 
about how the data will fall. Also often, 
if not always, they care about how these 
data fall, Some outcomes may be ex- 
pected more than others; some outcomes 
may be desired more than others. Our 
purpose here is to discuss the question of 
whether experimenter’s orientation (ex- 
pectations and wishes) can affect the 
data actually obtained in his research. 
We are not so much concerned here with 
the problem of choice of experimental 
aper was facilitated 
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design or procedure and the fact that 
certain designs and procedures may un- 
intentionally be more or less favorable 
to obtaining expected or unexpected 
data. Neither are we concerned with 
the problems of statistical tests of hy- 
potheses and the fact that uniquely most 
powerful statistics may unintentionally 
be employed when the expectation is to 
be able to reject the null hypothesis, 
while less powerful statistics may be 
employed when the expectation is to be 
unable to reject the null hypothesis. 
These are interesting questions but will 
not be considered here. Our usage of 
“results” or “outcome” will be restricted 
to the raw data obtained by experiment- 
ers from their subjects. 

The effects of experimenter’s outcome 
orientation, or bias, were seriously con- 
sidered by Wilson (1952), the physical 
scientist. Wilson felt that positive or 
expected data might too often occur be- 
cause of researchers’ interest in the out- 
come of their experiments. Their ex- 
pectancies about data might determine 
in part the data obtained. This notion 
seems most related to Merton’s (1948) 
concept of self-fulfilling prophecy. One 
prophesies an event and the expecta- 
tion of the event, then changes the 
prophet’s behavior in such a way as to 
make the predicted event more likely. 
Related, too, is Heider’s (1958) con- 
cept of “personal causality” and his dis- 
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cussion of the fulfillment of personal 
expectancies. 


OvuTCOME-ORIENTATION EFFECTS IN 
EVERYDAY LIFE 


The way a man golfs or bowls may be 
determined by his expectancy of his 
performance. Of greater interest here 
is the notion that how one person ex- 
pects another to perform these activities 
can determine in part how he actually 
does perform. In an intensive study of 
a social group of young men, Whyte 
(1943) found that the group, and es- 
pecially its leaders, “knew how well a 
man should bowl.” This “knowledge” or 
expectancy probably did partially deter- 
mine that man’s actual performance. 
Perhaps the morale-building banter of- 
fered the one expected to do well helped 
him to do well, Similarly, the communi- 
cated expectancy to another that he 
would do poorly “shook him up” suffi- 
ciently to interfere with his performance 
(see also Back, 1951). 

Fascinating data collected at the 
Bank Street College of Education sug- 
gest that in the schoolroom as in the 
bowling lanes, expectancies may be 
powerful forces determining others’ be- 
havior. Data described by John 
Niemeyer,? President of the College, 
lend support to the hypothesis that 
lower-class, minority-group children are 
low achievers, at least Partly because of 
their teachers’ expectation that these 
students are really not educable, 


OurcoME-ORIENTATION Errects IN 
CLINICAL PRACTICE 


As highly skilled a clinician as Fromm- 
Reichmann (1950) was impressed by the 
effects of the self-fulfilling prophecy, 
although she did not use that term. She 
spoke rather, as other clinicians have, of 
iatrogenic psychiatric incurabilities, The 
therapist’s expectancy, she felt, might 
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determine whether given symptoms 
might be relieved or cured. This clinical 
impression is somewhat supported by the 
work of Heine and Trosman (1960) who 
felt that the variable significant for a 
patient’s continuance in psychotherapy 
was that of mutuality of expectation be- 
tween therapist and patient. Goldstein 
(1960) found no client-perceived person- 
ality change due to psychotherapy re- 
lated to therapist’s expectancy of such 
change. However, therapist’s expectancy 
was related to duration of psycho- 
therapy. Additionally, Heller and Gold- 
stein (1961) found therapist’s expecta- 
tion of client improvement significantly 
correlated (.62) with change in client’s 
attraction to therapist. These workers 
also found that after 15 sessions, clients 
behavior was no more independent than 
before, but their self-descriptions were 
of more independent behavior. The 
therapists generally were favorable to 
increased independence and tended to ex- 
pect successful cases to show this de- 
crease in dependency. Clients may well 
have learned from their therapists 
that independent-sounding verbaliza- 
tions were desired and thereby served 
to fulfill their therapist’s expectancy. 
The role of expectancy in the psycho- 
therapeutic situation has been most fully 
discussed and reviewed by Goldstein 
(1962), 

But Psychotherapy is not the only 
realm of clinical practice in which ex- 
Pectancy effects may determine out- 
comes. The fatality rates of delirium 
tremens have recently not exceeded 
about 15%. However, from time to 
time new treatments of greatly varying 
sorts are reported to reduce this figure 
almost to zero. Gunne’s (1958) work 
in Sweden summarized by the Quarterly 
Journal of Studies on Alcohol Editorial 
Staff (1959) showed that any change 1” 
therapy led to a drop in mortality rate- 
One interpretation of this finding is that 
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the innovator of the new treatment ex- 
pects a decrease in mortality rate, an 
expectancy which leads to subtle dif- 
ferential patient care over and above the 
specific treatment under investigation. 
A prophesy again may have been self- 
fulfilled. 

In the practice of medicine in general, 
the role of physician expectancy looms 
large. In a very comprehensive paper 
dealing with placebo effects, Shapiro 
(1960) cites the well-known admonition: 
“You should treat as many patients as 
possible with the new drugs while they 
still have the power to heal [p. 114].” 
The wisdom of this statement may have 
its basis in the concept of the physician’s 
faith in the power of the drug. This 
“faith” may have at its core expectancy 
as we are discussing it. The physician’s 
expectancy about the efficacy of a treat- 
ment may be subtly communicated to 
the patient with resulting influence on 
the patient’s psychobiological response. 


OutTcoME-ORIENTATION EFFECTS IN 
SURVEY RESEARCH 


Perhaps the classic work in this area 
was that of Rice (1929). A sample of 
2,000 applicants for charity was inter- 
viewed by a group of 12 skilled inter- 
viewers. Interviewers talked individually 
with their respondents who had been 
assigned in a wholly nonselected manner. 
Respondents ascribed their dependent 
status to factors predictable from a 
knowledge of the interviewers’ outcome 
orientations. Thus, one of the interview- 
ers who was a staunch prohibitionist ob- 
tained three times as many responses 
blaming alcohol as did another inter- 
viewer regarded as a socialist, who in 
turn obtained half again as many re- 
sponses blaming industrial factors as did 
the prohibitionist interviewer. Rice con- 
cluded that the outcome orientation Or 
bias of the interviewer was somehow 
communicated to the respondent who 


then replied as expected. Hyman, Cobb, 
Feldman, Hart, and Stember (1954) 
took vigorous exception to Rice’s inter- 
pretation and preferred to ascribe his 
remarkable results to errors of recording 
or interpretation. The plain fact, of 
course, is that we cannot say whether 
these effects were ones which actually 
changed respondents’ replies or not. 
There is no question in either case that 
the results of the research were affected 
by the investigators’ outcome orienta- 
tion. 

One of the earliest studies deliberately 
creating differential expectancies in inter- 
viewers was that conducted by Harvey 
(1938). Each of six boys was inter- 
viewed by each of five young post- 
graduates. The boys were to report to 
the interviewers on a story they had 
been given to read. Interviewers were 
to use these reports to form impressions 
of the boys’ character. Each interviewer 
was given some contrived information 
about the boys’ reliability, sociability, 
and stability, but told not to regard 
these data in assessing the boys. Stand- 
ardized questions asked of the inter- 
viewers at the conclusion of the study 
suggested that biases of assessment oc- 
curred even without interviewers’ aware- 
ness and despite conscious resistance to 
bias. Harvey felt that the interviewers’ 
bias evoked a certain attitude towards 
the boys which in turn determined the 
behavior to be expected and then the in- 
terpretation given. Note how neatly this 

formulation fits the model put forth by 
Merton. Again, we cannot be sure that 
subjects’ responses were actually altered 
by interviewer expectancies. The pos- 
sibility, however, is too provocative to 
overlook. 

More recent evidence for an expect- 
ancy (outcome orientation) bias comes 
from the work of Hanson and Marks 
(1958). The most thorough discussion 
of this problem for the survey research 
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literature is that by Hyman et al. 
(1954), which also carries an extensive 


bibliography. 


OUTCOME-ORIENTATION EFFECTS IN 
EXPERIMENTAL RESEARCH 


It is well known that a great many 
studies have been conducted to establish 
the validity or invalidity of the Ror- 
schach technique of personality assess- 
ment. A systematic study of 168 of 
these studies was undertaken by Levy 
and Orr (1959) who categorized each 
study on each of the following dimen- 
sions: the academic versus nonacademic 
affiliation of the author, whether the 
study was designed to assess construct 
versus criterion validity, and whether 
the outcome of the study was favorable 
or unfavorable to the hypothesis of Ror- 
schach validity. Results showed that 
academicians were more interested in 
construct validity and that their out- 
comes were relatively more favorable to 
construct validation and less favorable 
to criterion validation. On the basis 
of their findings, these workers called 
for more intensive study of the re- 
searcher himself, “For, intentionally or 
not, he seems to exercise greater control 
over human behavior than is generally 
thought [p. 83].” We cannot safely 
conclude that the findings reported were 
a case of the effect of outcome orienta- 
tion or bias. It might have been that the 
choice of specific hypotheses for test- 
ing, or that the choice of manner of test- 
ing them determined the apparently 
biased outcomes. At the very least, how- 
ever, this study accomplished its task of 
calling attention to potential biasing ef- 
fects of experimenters. 

Perhaps the earliest study which em- 
ployed a straightforward experimental 
task and actually manipulated an out- 
come-orientation variable was that of 
Stanton and Baker (1942). In their 

study, 12 nonsense geometric figures 
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were presented to a group of 200 under- 
graduate subjects. After several days, 
retention of these figures was measured 
by five experienced workers. Epir 
menters were supplied with a key 0 

“correct” responses, some of TT 
were actually correct but some of whic! 

were incorrect. All experimenters were 
explicitly warned to guard against any 
bias associated with their having the 
keys before them and therefore aaa 
ing their subjects to guess correct y- 
Results showed that the experimenter 
obtained outcomes in accordance with 
his expectations. When the item on the 
key was correct, the subject’s response 
was more likely to be correct than when 
the key was incorrect. In a careful repli- 
cation of this study, Lindzey (1951) 
emphasized to his experimenters the im- 
portance of keeping the keys out of the 
subjects’ view. This study failed to con- 
firm the Stanton and Baker findings. The 
85 subjects of Lindzey’s study were 
much more of a volunteer saree 
than were the subjects of the origina 
study. We simply cannot say whether 
this fact might have accounted (in whole 
or in part) for the difference. Another 
replication by Friedman (1942) also 
failed to obtain the significance level 
obtained in the original. Still, significan 
results of this sort, even occurring only 
in one out of three experiments, canno 
be dismissed lightly. Stanton (1942, = 
PP. 16-17) himself presented furthe 
evidence which strengthened his con- 
clusions. He employed a set of nonsens? 
materials, 10 of which had been pre 
sented to subjects and 10 of which a 
not. Experimenters were divided “i 

three groups. One group was correctly 
informed as to which 10 materials pe 
been exposed, another group was inco" 
rectly informed, while the third gow 
was told nothing. The results of ae 
study also indicated that the materia 

which experimenters expected to 
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more often chosen were, in fact, more 
often chosen. 

An experiment analogous to those just 
described was conducted in a psycho- 
physical laboratory by workers (Warner 
& Raible, 1937) who interpreted their 
study within the framework of parapsy- 
chological phenomena. The study in- 
volved the judgment of weights by 
subjects who could not see their experi- 
menter. The latter kept his lips tightly 
closed to prevent unconscious whisper- 
ing (Kennedy, 1938). In half the ex- 
perimental trials, the experimenter knew 
the correct response from a key. Of 
thë 17 subjects, 6 showed a standard 
error of 1.0 or more from a 50-50 distri- 
bution of errors. All 6 of these subjects 
made fewer errors on trials on which 
the experimenter knew which weight was 
lighter or heavier. At least for those 
subjects who were somewhat affected by 
the experimenter’s knowledge of the cor- 
rect response, the authors’ conclusion 


seems justified. As an alternative to the — 


interpretation of these results as ex- 
trasensory perception (ESP) phenom- 
ena, they suggested the possibility of 
some form of auditory cue transmission 
to the subjects. 

Among the most recent studies in the 
area of ESP are those by Schmeidler and 
McConnell (1958). These workers 
found that subjects who believed ESP 
possible (“sheep”) performed better at 
ESP tasks than did subjects who did 
not believe ESP possible (“goats”). 
These workers suggested that an experi- 
menter by his presentation, might affect 
subjects’ self-classification, thereby in- 
creasing or decreasing the likelihood of 
successful ESP performance. Similarly, 
Anderson and White (1958) found that 
teachers’ and students’ attitudes toward 
each other might influence performance 
in classroom ESP experiments. The 
mechanism operating here might also 
have been one of certain teachers’ ex- 


409 


pectancies which were communicated to 
the children whose self-classification as 
sheep or goats might thereby be affected. 
The role of the experimenter in the 
results of ESP research has been dis- 
cussed by Crumbaugh (1959) as a source 
of evidence against the existence of the 
phenomenon. We file no brief here for 
or against ESP, but suggest that if, in 
carefully done experiments, certain types 
of experimenters obtain certain types of 
ESP performances in a predictable 
manner (as suggested by the studies 
cited), that further evidence for the ef- 
fects of experimenter outcome-orienta- 
tion will have been adduced (Rhine, 
1959). 

In a more traditional area of psy- 
chological research—memory—Ebbing- 
haus (1913) called attention to similar 
experimenter effects. In his own re- 
search he noted that his expectancy of 
what data he would obtain affected the 
data he subsequently did obtain. He 
pointed out, furthermore, that the ex- 
perimenter’s knowledge of this expect- 
ancy was not sufficient to control the 
phenomenon. This finding has been un- 
fortunately neglected by many sub- 
sequent researchers in the area. 

Another possible case has been de- 
scribed by Stevens (1961). He discussed 
the controversy between Fechner and 
Plateau over the results of bisection 
experiments to determine the nature of 
the function describing the operating 
characteristics of a sensory system. 
Plateau held that it was a power rather 
than a log function. Delboeuf carried 
out experiments for Plateau, but ob- 
tained data approximating the Fechner- 
ian prediction of a log function. Stevens 
puzzled over these results which may be 
interpreted within the notion of ex- 
perimenter outcome-orientation. Either 
by implicitly expecting the Fechnerian 
outcomes or by attempting to guard 
against an anti-Fechnerian bias, Del- 
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boeuf may have influenced the outcome 
i S. 
i w use appear that Pavlov was 
aware of the possibility that experi- 
menter outcome-orientation might affect 
the results of experiments. In an ex- 
change of letters in Science, Zirkle 
(1958) and Razran (1959) in discussing 
Pavlov’s attitude toward the notion of 
the inheritance of acquired character- 
istics, gave credence to a statement by 
Gruenberg (1929): 


In an informal statement made at the time of 
the Thirteenth International Physiological Con- 
gress, Boston, August, 1929, Pavlov explained 


that in checking up these experiments, it was 
found that the apparent improvement in the 
ab 


ility to learn, on the part of successive gen- 
erations of mice, was really due to an improve- 
ment in the ability to teach, on the part of 
the experimenter ! And so this “proof” of the 
transmission of modifications drops out of the 
Picture, at least for the present [p. 327]. 


Wherry è has told of an experiment in 
which rats were able to discriminate 
colors, but only when the experimenter 
was in the room. Christie’s (1951) inter- 
pretation of some differences between 
Towa and Berkeley rats also suggests the 
Possibility of experimenter effects as- 
sociated with his outcome orientation. 
But perhaps the best-known and most 
instructive case illustrating the effects of 
outcome orientation is that of Clever 
Hans (Pfungst, 1911). By means of 
tapping his hoof, the horse of von Osten 
was able to spell, read, and solve prob- 
lems of arithmetic and musical harmony. 
Unlike the owners of other Performing 
animals, Hans’ owner did not profit from 
his animal’s talent, and permitted any 
serious investigator to test Hans even in 
von Osten’s absence. Pfungst, and his 
colleague Stumpf, undertook to discover 
the secret of Hans’ talents. . 
A series of brilliant and painstaking 
experiments revealed that Hans’ ques- 
tioners cued him unintentionally. A for- 
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ward inclination of the questioner’s head 
served as signal to Hans to begin E 
hoof tapping. A slight upward motion 2 
the questioner’s head or eyebrows seryes 
as signal for Hans to stop his tapping. 
Hans’ amazing talents, then, may be 
viewed as an illustration of the power of 
the self-fulfilling prophecy. Questioners, 
even skeptical ones, expected Hans to 


know the correct answers to their queries.. 


Their expectation was reflected in their 
signal to Hans that they awaited the ces- 
sation of his tapping. This signal brought 
on the expected cessation and Hans was 
correct again. , 

Pfungst aptly summarized the diffi- 
culties in uncovering the nature of Clever 
Hans’ talents by speaking of “looking 
for, in the horse, what should have been 
sought in the man.” 

Turning to a more recent example of 
Possible outcome-orientation effects, we 
will describe an experiment dealing with 
the Freudian defense mechanism of pro- 
jection (Rosenthal, 1956). A total of 
108 subjects was randomly divided into 
three groups each receiving success, 
failure, or neutral experience on a task 
structured as and simulating a standard- 
ized test of intelligence. Before the sub- 
jects’ experimental-treatment condition 
was imposed, they were asked to rate the 
degree of success or failure of persons 
Pictured in photographs. Immediately 
after the experimental manipulation, the 
subjects were asked to rate an equivalent 
Set of photos on their degree of success 
or failure. The dependent variable was 
the magnitude of the difference scores 
from pre- to postratings of the photo- 
graphs. It was hypothesized that the 
success-treatment condition would lead 
to greater subsequent perception of 
other people’s success, while the failure- 
treatment condition would lead tO 
greater subsequent perception of other 
people’s failure as measured by the pre- 
post difference scores, 
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An analysis (which was essentially un- 
necessary to the main purpose of the 
study) was performed which compared 
the mean preratings of the three experi- 
mental-treatment conditions, Preratings 
by subjects in the success-treatment 
group were significantly lower and less 
extreme than the prerating by subjects 
in the other conditions. In terms of 
the hypothesis under test, a lower pre- 
rating by this group would tend to lead 
to significantly different difference scores 
if the postratings were similar for all 
treatment conditions. Without the in- 
vestigator’s awareness, the cards had 
been stacked in favor of obtaining 
results confirming the hypothesis under 
test. It should be emphasized that the 
success and failure groups’ instructions 
had been verbally identical during the 
prerating phase of the experiment. 

The investigator, however, was aware 

for each subject which experimental 
treatment the subject would subse- 
quently be administered. 
The implication is that in some subtle manner, 
perhaps by tone, or manner, or gestures, or 
general atmosphere, the experimenter, although 
formally treating the success and failure groups 
in an identical way, influenced the success Ss 
to make lower initial ratings and thus increase 
the experimenter’s probability of verifying his 
hypothesis [Rosenthal, 1956, p. 44]. 


Reports of the findings of the sort just 
presented are not numerous and virtually 
never published. Nevertheless, their oc- 
currence can be documented.* Allusions 
to the effects of the experimenter’s out- 
come orientation ë in general have been 
made by Edwards (1950); Feldman 
(1956); Foster (1923);  Riecken 
(1962); Cohen, Silverman, Bressler, and 
Shmavonian (1961); and half face- 
tiously by Ammons and Ammons 
(1957): and Rotter.® 


10. Gardebring, 1962; J. Gengerelli, 1956; 
G. Mount, 1956; and G. Rosenwald, 1963; per- 
Sonal communications. 

5A series of experiments specifically de- 
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ROLE OF MUSCLE TENSION IN PERSONALITY 
THEORY 
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The relationship of muscle tension to personality is reviewed through the 
19th century to the present. Although the methods of measurement are 
often only indirectly related to one another certain trends emerge. Skeletal 
muscle tension tends to be high: in the hyperactive person, during emo- 
tional stress, in psychotic and neurotic groups, and particularly in dis- 
orders in which anxiety is the major concomitant. The mechanism of 
symptom specificity, whereby a particular muscle develops symptoms due 
to its constant state of sustained contraction, is also discussed. 


The concept of muscle tension has 
been a very important one in the psycho- 
logical literature. From the 1920s to the 
present it has aroused considerable 
speculation and stimulated a good deal 
of research. It has been related to learn- 
ing (Bills, 1927; Courts, 1942), motiva- 
tion (Bartoshuk, 1955), difficulty of 
mental work (Stroud, 1931), degree of 
arousal (Kennedy & Travis, 1948), re- 
action time (Teichner, 1957), attention 
(Wallerstein, 1954), and thinking (Ja- 
cobson, 1932), to mention a few. The 
body of this paper will be concerned with 
its role in personality theory. 

In order to discuss muscle tension 
meaningfully it becomes necessary to 
clarify what one means by this term. 
Jacobson (1938), for example, refers to 
muscle tension in his book on Progressive 
Relaxation as a somewhat vaguely de- 
fined condition of nervous hypertension 
or nervous hyperexcitability. In some 
Cases tension appears to be synonymous 
with tonus, while other writers attempt 
to separate the two terms and to confine 
tonus to a state of sustained contraction 
due to a continuous barrage of nerve im- 
pulses, Although Freeman (1948) speaks 
of the two as similar, he feels that ten- 
Sion is subject to cerebral influences. 


1 The author wishes to express her thanks to 
: A. Wenger for his comments on the origi- 
nal draft of this paper. 


Underlying the various descriptions of 
muscle tension one finds the idea of 
skeletal muscular contraction. For this 
reason, it appears to be most meaningful 
to define muscle tension in terms of 
muscular contraction rather than arguing 
over the different definitions. In states 
of complete relaxation there would be 
no muscular contraction and, therefore, 
no tension (Broman, 1949). When the 
muscle is stimulated, however, it con- 
tracts and there are simultaneous electri- 
cal, chemical, structural, and thermal 
changes. 

A major difficulty in attempting to 
define muscle tension is that it fre- 
quently depends on what measure is used 
to obtain it. In an extensive article by 
Davis (1942), different measures that 
have been claimed to be indicators of 
muscle tension were reviewed. They 
range from modern electrical recording 
devices to some rather indirect means of 
assessment, It is important to keep 10 
mind the method of measuring tension 1n 
each of the studies in this area. 


INDIVIDUAL DIFFERENCES mn MUSCLE 


‘TENSION 


In the early twentieth century miny, 
writers speculated about the appa a 
relationship between the degree to w! 
an individual expressed himself throug: 


the skeletal muscular system and E 
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i ocial extraversion, energy level, 
S ea excitability. While some of 
the early studies were based upon ex- 
perimentation, a rather large number 
were merely clinical accounts of per- 
sonality types. In addition, estimates of 
muscle tension were crude and fre- 
quently were based only on cursory 
descriptions of individuals. 

There seems to be a rather general 

belief that the person who was constantly 
in a state of heightened tension would 
display this tension through certain ex- 
pressive movements. On the basis of such 
a generalization, personality theorists at- 
tempted to show how a person’s gait, 
posture, and gesture revealed relatively 
enduring personality attributes, inde- 
pendently of any mood or situational 
changes. 

Muscle tension as revealed in voice 
and posture was believed by Dunbar 
(1946) and Deutsch (1947) to repre- 
sent an individual’s means of expressing 
habitual defenses, Particularly in states 
of conflict were defenses and repressed 
emotions reflected in expressive move- 
ments. 

As far back as 1890, William James 
divided individuals into hyperactive and 
hypoactive types. Similar categoriza- 
tions were postulated by Davenport 
(1915), Downey (1923), Allport and 
Vernon (1933), and Wolff (1933). At 
one end of the continuum was the calm, 

phlegmatic individual; while at the other 
extreme was the highly excitable person. 
These types were related by some (Ham- 
mett, 1920; Johnson, 1928) to the fre- 
quency and intensity of muscular con- 
traction under emotional stimulation, 
McKinley and Berkwitz (1928) and 
Duffy (1930), rather than defining per- 
sonality types in terms of amount of ten- 
sion, added the criterion of Variability, 
Whereas the phlegmatic individual was 
felt to exhibit a fairly constant tension 
level, the highly excitable person showed 
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a marked variation in tension level from 
one time to the next. 

As supportive evidence for a person- 
ality theory based on differential activa- 
tion of the skeletal muscular system, 
Duffy (1932a, 1957) and Arnold (1942, 
1943) have attempted to show that indi- 
viduals retain their respective ranks in 
amount of muscle tension displayed from 
one situation to the next. Although 
Duffy (1946) admits the existence of 
intraindividual differences due to vari- 
ance in measurement techniques, nature 
of the task performed, and time at which 
data is gathered, she does not believe 
that such specific differences transcend 
an independent and general factor of 
muscle tension. Davis (1938) also be- 
lieves that there are marked individual 
consistencies in muscular activity. : 

As compared to the more relaxed, in- 
dividuals who are highly responsive in 
terms of their skeletal musculature also 
tend to be generally hyperactive. The 
tendency to respond to noise with a high 
increase in muscle tension was found 
by the present writer (Balshan, 1962) 
to be negatively related to Restraint and 
positively related to General Activity On 
the Guilford-Zimmerman Temperament 
Survey. The tension score was based on 
those muscles with the most significant 
loadings on a general muscle-tension fac- 
tor. Muscle tension was defined by 
means of the electromyogram. There 
was no relationship with any of the other 
Guilford-Zimmerman temperaments. , 

Lundervold (1952) found that in- 
creased activity as shown by the elec- 
tromyograph was correlated with a tend- 
ency to react to increases in noise, 
decreases in room temperature, and 
changes in other stimuli with anger and 
irritation. Similarly, Freeman and Kat- 
zoff (1932) revealed a positive corre- 
lation between grip-pressure scores an 


subjective ratings for annoyance a0 
irritability. 
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The fact that individuals tend to re- 
spond in an idiosyncratic manner by 
means of their skeletal musculature was 
also indicated by Jacobson and Kraft 
(1942). On the basis of action poten- 
tials recorded during a period of read- 
ing, three types of responses became ap- 
parent. Some individuals showed an 
increase in muscle tension, others a de- 
crease, while a third group responded 
with ups and downs throughout the 
entire period. 

A rather unique personality break- 
down in terms of response to stress was 
made by Kempe (1956). He found that 
those persons who tend to respond to 
stress by a rather general increase in 
muscle tension are able to deny emotion 
and to intellectualize quite easily. They 
remain aloof from others, maintaining 
very little regard for social convention. 
Contrasted with such types there are 
those individuals who, during stress, re- 
act by way of the autonomic nervous 
system. Autonomic responders are emo- 
tionally sensitive, prone to worry a great 
deal, and have fears of not being ac- 
cepted by others. 

There also appears to be some rela- 
tionship between muscle tension and so- 
cial adjustment (Lewis, 1937). Duffy 
(1932a) showed that the greater the de- 
gree of pressure applied to a dynamom- 
eter the lower were children rated by 
their nursery-school teachers on their 
degree of environmental adjustment. 
The more tense group was believed to 
be living under conditions of strain or 
effort. 

Boys judged to be most tense were 
found by Wenger (1938) to be emo- 
tional, unstable, irritable, and aggres- 
sive. Using a larger sample of children 
Wenger (1943) obtained positive cor- 
relations between ratings of muscle 
tension and energeticness, frequency of 
emotion, carelessness, restlessness, dis- 
tractibility, and impulsiveness. There 
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were negative correlations between the 
ratings and fatigability, sensitiveness, 
and emotional control. 

Negative results were found by 
Martin (1958a, 1958b) who selected 
students with extreme scores in the fol- 
lowing four groups: introverted neu- 
rotics, extraverted neurotics, introverted 
normals, and extraverted normals. Left 
eyelid activity and muscle-action poten- 
tials of the frontalis and the right fore- 
arm extensors were obtained from each 
of the four groups during rest and verbal 
activity. None of these measures was 
found to be related to the personality 
dimensions. 

Just how much muscle tension varies 
from one situation to the next is still 
somewhat of an empirical question. The 
results of Voas (1952) indicate that the 
amplitude of action potentials from 
seven separate muscle groups vary not 
only for different individuals, but under 
different conditions in the same indi- 
vidual. While Malmo (1957) would 
probably concur with this view, he would 
also tend to state that there are some 
individuals who show higher levels of 
physiological reactivity than others to 
a larger number of different situations. 
What these situations have in common is 
that they all appear stressful to the 
individual. Only to the extent that such 
persons are able to avoid conditions with 
heightened arousal values, do they react 


similarly to other people. 


RESPONSES TO EMOTIONAL STRESS 


Almost all individuals react to emo- 


tional stress with some increase in aton 
in the skeletal motor system. Emotiona. 


A es 
material produces changes 10 the m 
and duration © 


quency amplitude, i 
muscle-action potentials taken — 
forehead (Newman, 1953), as be = 
increasing tension in the neck an 2 
forearm flexors and extensors ( : 


1957). When severe emotional stimuli 
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i uced there were increases in 
ee a and hand movements 
which were closely related to the degree 
of induced excitement (Kellogg, 1932). 

According to Duffy (1932b), the most 
extreme increases in muscle tension 
result from emotional stimuli, partic- 
ularly when an individual is confronted 
with problems of adjustment. Not only 
does he apply increased effort at that 

time, but frequently his excitation level 
becomes too high and coordination of 
responses becomes more difficult. 

Under increased strain Arnold (1942, 
1943) found the more tense individual 
to break down relatively quickly due 
to an inability to utilize specific muscula- 
ture in attending to tasks, thereby re- 
sponding with a rather inefficient means 
of energy expenditure. 

By means of pressure on tambours, 
Luria (1932) studied muscular reaction 
in response to distressing verbal stimuli. 
During a state of affect there was a dis- 
organization of response, taking on an 
impulsive, excitatory character, accom- 
panied by a tremor. In response to 
critical words not only did the individual 
apply more pressure to the tambour for 
his right hand, but this tended to over- 
flow to the nonactive hand as well, 

In a similar study, Runkel (1936) 
reported that reactions to words of a 
critical nature were in the form of wave- 
like tremors immediately preceding the 
motor response. The fact that there 
were no appreciable changes in the non- 

active hand may have been due to the 
relatively low affect in the experimental 
situations. 

Sudden increases in muscle tension 
are particularly apparent during the 
discussion of stressful material in psy- 
chiatric interviews. Not only are these 
increases revealed in the patient, but 
they can be found in the examiner > 
well. Whereas speech-muscle tension 0: 
the patient fell rapidly in response to 


praise, it remained high when he was 
criticized. A similar response pattern 
was exhibited by the therapist (Davis & 
Malmo, 1951; Malmo, Boag, & Smith, 
1957). 


PSYCHIATRIC GROUPS 


Under normal conditions, brief and 
temporary states of heightened tension 
commonly occur in response to problem- 
solving situations which require a certain 
degree of attentiveness. Under such con- 
ditions, however, the degree of tension 
necessary to keep the individual alert 1s 
usually not sufficient to call forth his 
total resources or to bring on patho- 
logical symptoms of overstimulation. On 
the other hand, there are certain individ- 
uals who are almost always extremely 
tense. They are no longer beneficially 
stimulated, but instead become hype- 
active and restless. Accompanying the 
marked excitability of the muscular SyS- 
tem are clinical manifestations of a 
creased tendon reflexes, spasticity © 
smooth muscle, tremor, a slightly 1 
regular respiration rate, and a somewhat 
increased pulse rate (Guyton, 1956; 
Jacobson, 1938). F 

Some (Freeman, 1939; Plutchik, 
1954) claim that persistently high 
muscle-tension levels commonly exist 1° 
certain neurotic individuals who are 1? 
constant states of conflict, The conflict 
is thought to produce emotional re- 
actions which are muscular preparations 
for action. 

In an attempt to test the hypothes!s 
that excessive stimulation of the skeletal 
motor system exists in psychological dis- 
orders, Henley (1935) compared not- 
mals with psychiatric groups, bot 
psychotic and neurotic. With all groups 
combined, tension proved to be higher 
for men than women. For normal me? 
it was higher than for patient groups ° 
men, but was higher for female patients 
than normal women. With the introduc 


SS Se -—“‘i‘“ OOO” 


MUSCLE TENSION IN PERSONALITY THEORY 417 


tion of a mental task there was an in- 
crease in the amount of tension in psy- 
chotic and neurotic women, while all of 
the other groups showed a decrease. 
When the psychiatric population was 
further subdivided, tension scores and 
their scatter about the mean were dis- 
tributed in the following order from 
greatest to least: manic-manic, psycho- 
neurotic, dementia praecox, and de- 
pressed-manic. Because of the question- 
able reliability of the measure used to 
define muscle tension (time taken by 
the forearm to cover a defined arc when 
moved by a stated weight and the out- 
moded diagnostic categories), the results 
are somewhat dubious. 

More recently it has been shown that 
psychiatric patients exhibit increases in 
muscle tension under most conditions. 
Although stress usually brings about 
marked increments in tension, it is prob- 
able that to such individuals most situa- 
tions are perceived as being stressful. 
During relatively simple voluntary motor 
tasks higher residual action potentials 
were recorded from forearm flexors and 
extensors of patients than from normal 
individuals (Davidowitz, Brown-May- 
ers, Kohn, Welch, & Hayes, 1955). 

There were distinct differences be- 
tween well-adjusted and emotionally 
unstable children in physiological meas- 
ures taken during rest, attention, learn- 
ing, frustration, recall, and sensory 
stimulation. While some differences 
were found to exist during both rest and 
attention, they were particularly preva- 
lent during frustration. In the emo- 
tionally unstable group an exaggerated 
reaction was found in skin resistance, 
heart rate, blood pressure, respiratory 
irregularity, muscle tension (as meas- 
ured by amount of tremor), and suppres- 
sion of alpha rhythm. Whereas pre- 
psychotic and neurotic individuals 
presented unstable patterns and reacted 
most violently to frustration, both 


psychotic (schizophrenic) and well- 
adjusted persons showed patterns of 
greater stability (Jost, 1941; Sherman 
& Jost, 1942). 

In a comparison of manic-depressives, 
schizophrenics, psychoneurotics, and 
normal adults during motor tasks, in 
contrast to the results of Sherman and 
Jost, it was the schizophrenics who 
showed the greatest increases in muscle 
tension (Wulfeck, 1941). The ap- 
parently contradictory results may be 
due to several factors. To begin with, 
Sherman and Jost only tested three 
schizophrenics. In addition, the only 
measure of muscle tension was the 
amount of tremor present. Moreover, the 
amount of physiological reactivity may 
be related to the chronicity of the 
psychosis. 

As compared to 10 normal individuals 
a group of 21 schizophrenics were found 
by Whatmore and Ellis (1958) to have 
higher muscle-action-potential levels in 
the forehead, jaw, forearm, and leg. A 
similar result was reported by Petursson 
(1962) in a group of 10 patients, most 
of whom were schizophrenics. 

During conditions of pain-stimulation, 
rapid discrimination, and mirror draw- 
ing, psychoneurotics, acute psychotics, 
and chronic schizophrenics, as compared 
to normals, responded by higher muscle 
tension in the neck and right arm. Such 
findings were identical for all three stress 
conditions. While muscle-tension level 
was as high in schizophrenics as in the 
other psychiatric patients, schizophrenics 
were less reactive if the pain stimulus 
was of brief duration (Malmo, Shagass, 
& Smith, 1951). 

Very little work has been eae 
on depressed patients. Whatmore m 
Ellis (1959) took electromyograp Ic 
recordings of the forehead, jaw, see 
and leg in 19 female depressives, T 
~hom exhibited ' xtreme motor retatt & 
tion. When compared wih a Bae 
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arable age, the patient 
ot ee e by higher 
a me levels. The magnitude of 
elevation, however, was greater in the 
more retarded group. When further in- 
vestigations (Whatmore & Ellis, 1962) 
were carried out with five of the retarded 
patients, there was found to be a drop 
in muscle tension during psychiatric 
treatment. On the other hand, both 
before relapse and after treatment the 
muscle readings remained high. Such 
results would lead one to believe that 
high muscle tension is a general charac- 
teristic of the depressed person. 
Impaired motor control and excessive 
muscular tension have been found to 
characterize psychoneurotic disorders, 
especially during stress, Malmo, 
Shagass, and Davis (1951) believe that 
during mirror drawing all individuals 
(normal and Patient populations) re- 
spond with an equal “autonomic pre- 
paratory response.” The difference 
arises, however, in that psychotics and 
Psychoneurotics both have greater 
“skeletal muscle preparatory responses.” 
This suggests a defect in the skeletal 
motor system, resulting in greater 
tensional output (Malmo, 1956; Malmo, 
Shagass, Belanger, & Smith, 1951). 
Similarly, Psychoneurotics were noted 
by Jurko, Jost, and Hill (1952) to over- 
respond to specific Stimulation via the 
skeletal system and to have longer rates 
of recovery from stress, In addition to 
the increased blood pressure and lowered 
skin resistance with which normal 
individuals responded to Stress, the psy- 
choneurotics showed increases in body 
movement and respiration rate as well, 
During agitated states neurotic pa- 
tients evaluated by Jacobson (1952) 
exhibited increases in muscle tension in 
all four regions from which he recorded 
muscle action potentials: brachial biceps, 
quadriceps femoris, eyebrow, and a 
Martin (1956) compared mus 


IRIS BALSHAN GOLDSTEIN 


action potentials in psychotic, neurotic, 
and normal individuals from the fore- 
head and right forearm during various 
stimulus conditions and during rest. 
While there were no differences in the 
resting tension levels of the neurotic 
and normal individuals, a aie 
Was apparent between psychotic om 
normal persons, with psychotics being 
significantly more tense. In ae 
“dysthymics” showed greater amounts o 

tension than a group of hysterics. Among 
the various stress situations, only when 
responding to questions about their 
symptoms did the neurotic group mani- 
fest higher levels of forearm tension 
Differences between this study kae 

those of Malmo may be due to differ- 
ences in populations sampled. s 

In an investigation of muscle tension 
during handwriting in psychoneurotic 
and normal individuals it was found that 
the former group utilized both greater 
grip pressure on a stylus and more varl- 
ability in grip pressure and point 
pressure. Time spent in writing was also 
greater among neurotics, and their re- 
laxation time during writing was = 
Such results support the hypothesis tha 
greater muscular energy is expended by 
neurotics (Ruesch, Cobb, & Finesinget, 
1951; Ruesch & Finesinger, 19434, 
1943b). 

Due to the presence of muscular 
tension in so-called nervous individuals 
there is difficulty in relaxing, Lundervol 
(1951) has shown that individuals who 
are not so tense can relax in many pos 
tions and sit comfortably over long 
periods of time. More tense persons, 0” 
the other hand, are unable to relax 3D 
many positions and can only change wit 
a marked increase in muscle strain- : 

Although the inability to relax is quite 
marked among psychiatric populations 
Jacobson (1934) has found such 
phenomenon also to be characteristic # 
a group of colege students. After 
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period of training the group of neurotic 
individuals could be taught to relax to a 
greater extent than the untrained college 
students (Jacobson, 1938). 


ANXIETY 


Among the psychoneurotic disorders, 
one finds anxiety linked with muscle 
tension in the majority of clinical re- 
ports. Restlessness, strain, tension, and 
overexcitability are only a few of the 
terms commonly associated with anxiety 
states. The anxious person speaks of 
tension in his body and of mental 
tension, Yaskin (1937) defines anxiety 


as: 


a form of affectivity recognized introspectively 
as an unpleasant affect without known cause, 
or with cause inadequately understood or mis- 
taken, and manifested objectively by changes 
in neuromuscular, autonomic visceral and secre- 
tory functions (emotional expressions) [p. 9]. 


According to Jacobson (1942), ten- 
sion in the skeletal muscles is a com- 
ponent, rather than a mere accompani- 
ment of anxiety. The role of muscle 
tension in anxiety is suggested by a de- 
crease in both tremor and anxiety follow- 
ing relaxants (Dickes, Flamen, Cottrera, 
& Tobin, 1955). In addition, Kueth 
and Eriksen (1957) claim that some of 
the effects of anxiety on performance 
may be due to muscle tension which is 
also present. Anxiety states are stated 
by Cameron (1944a) to originate 1n 
those persons who have a tendency to 
develop tension more readily than most 
individuals. 

It is believed by Whitehorn (1944) 
that motor reactions are utilized by 
many neurotic patients to minimize their 
anxiety. Tics, embarrassed laughs, and 
obsessive rituals are forms of motor 
expression which serve as a release for 
the anxious person. 

In a comparison of the responses to 
a pain stimulus in anxiety neurotics, 
schizophrenics, and controls, the degree 


of physiological reactivity was related 
to the amount of anxiety present. Meas- 
ures of motor function (finger-movement 
and neck-muscle-action potentials) were 
found to be related significantly to 
degree of disturbance and to amount of 
anxiety (Malmo, 1956; Malmo & 
Shagass, 1949a). 

Eyeblink rate, an indirect index of 
tension, is felt by Meyer (1953) to be the 
best overall measure of generalized ten- 
sion. Meyer, Bahrick, and Fitts (1953) 
and Lovaas (1960) found that indi- 
viduals who score high on anxiety inven- 
tories also have high blink rates. Using 
muscle-action potentials in the arm as a 
measure of tension, Rossi (1959) also 
found a relationship between manifest 
anxiety scales and muscle tension. 

Although Lewinsohn’s (1956) group 
of anxious patients tended to show a 
greater tremor response to stress than a 
nonanxious control group, the differ- 
ence was not significant. Both the 
anxious patients and a group of ulcer 
patients did, however, have significantly 
greater variance of finger-tremor scores. 
Salivary secretion and skin conductance 
did not appear to distinguish the anxious 
group from the controls. 

The possibility exists that anxious and 
nonanxious individuals may be more 
easily differentiated by after responses 
to a stimulus than by immediate re- 
sponses. After a reaction, it takes the 
anxious person some time before he 
returns to a basal level (Barlow, 1959). 

Further support for such a hypothesis 
comes from a study by Malmo, Shagass, 
and J. F. Davis (1950). The initial 
tension level to a 100-cycle-per-second 


tone of 3 seconds duration, as measured 
by action potentials in the right forearm, 
was found to be equal in a control group 
and in one suffering from severe patho- 
logical anxiety. The control group 
returned in a short time to the previous 
level of tension; the anxious group 
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augmented the response and porma 
it during the entire period of auditory 
stimulation. As a result, the peak of = 
activity for the patients occurred muc 
later than for the controls. . 

When a burst of white noise was ad- 
ministered at 1-minute intervals, there 
was a somewhat larger initial response 
in anxiety patients than a group of con- 
trols (Davis, Malmo, & Shagass, 1954). 
A more significant difference, however, 
was the patients’ higher forearm-extensor 
tension, .3-.4 seconds after stimulation, 
when the response of the contr 
had almost subsided. 

There is some question as to whether 
anxious individuals are always mus- 
cularly tense, or if the elevation in ten- 
sion only occurs in response to a stressful 
situation. In other words, if it were pos- 
sible to get the anxious individual in a 
resting state, would he show any dif- 
ferences from a relatively nonanxious 


person? The present writer (Balshan, 
1962) found that wh 


were divided into hi 
groups on the basi 


‘ol group 


oise, however, 


he group reacted with 
significantly higher increases in muscle 


tension than the Nonanxious one, 
Malmo and Shagass (1949a, 1949p) 
feel that under conditions of Stress there 
is a high probability that reaction 
tendencies of anxious patients will go 
directly into action. Doehring (1957) 
has also found anxiety to be highly cor- 
related with certain tension reactions 
(eyeblink rate) in response to emo. 
tionally stressful words but not to 
relatively neutral words. A high score 
on a manifest anxiety scale is interpreted 
by Mednick (1957) as indicating that a 
person will show certain anxiety re- 
sponses during traumatic situations. 


the high-anxiety 


Anxiety is not felt to be a chronic 
which will manifest itself in a 
cumstance. The fact that anxious indivi . 
uals are sometimes reported | to a 
tain their high muscle-tension oe 
during rest may be due to the P ae 
the subjects were not truly relaxe x 
were reacting to the stress of the exp 
ment itself. 


Symptom SPECIFICITY 


While reaction to stress is frequently 
accompanied by increased activation E 
most of the skeletal musculature, bp SE 
increasing evidence that some me 
uals tend to exhibit high degrees of aid 
sion in specific muscle groups. In > ä 
individuals such a reaction is part 0 O 
life-long mechanism of responding, z 
emotionally-toned events. It is 
quently dependent not only on t 1 
Present situation but on piglet 
Constitution and past experience (Wolff, 
1948), 

After states of sustained muscular nd 
traction in certain muscles, those ante 
cles may become the source of pain an z 
tenderness, outlasting the actual nl 
traction. Wolff (1953) claims that hea a 
aches, backaches, periarticular pain, an 
aches of the extremities will develop 10 
individuals who are characteristically 
tense and manifest inappropriate rea 
iness for action, 

Sustained muscular contraction =. 
ondary to noxious impulses, from disease 
of any structures in the head, or a! 
Sociated with emotional tension iit 
anxiety, have been identified by Wo 
as a potential source of headache an 
Other head sensations. The intensity a 
the pain is Positively correlated wi d 
Muscle-action potentials from the es 
area. In cases of extreme apprehensio” 
the muscles contract as if preparing th 
individual for action; but the action way 
not occur. As a result, there is a persis 
ent feeling of soreness radiating from 
the head and neck. Stimulation of 
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pain nerve endings is felt to be the result, 
not only of sustained contraction, but 
of reduction of blood supply to the mus- 
cles caused by artery constriction and 
interference with blood flow (Simons, 
Day, Goodell, & Wolf, 1943; Wolf & 
Wolff, 1953; Wolff, 1948). 

Those patients having head, neck, or 
arm symptoms were found by Sainsbury 
and Gibson (1954) to have higher action 
potentials in the relevant muscle than 
those without the symptom. It is also sig- 
nificant that the onset of headache was 
accompanied by corresponding increases 
in the activity of the frontalis muscle. 
Those more generalized bodily symp- 
toms, such as tremor and startle, were 
accompanied by increases in muscle 
activity in well-separated areas. 

In an investigation of 400 patients, 
Friedman and Von Storch (1953) found 
what they called tension headache to be 
predominantly an emotionally produced 
physiological reaction to anxiety-provok- 
ing situations. Although relief was 
obtained by placebo in 50% of the cases, 
long-lasting effects were achieved only 
through psychotherapy. 

Other individuals tend to develop 
backaches due to sustained contractions 
in the lumbar sacro spinalis, trapezius, 
and hamstring muscles. Situations which 
either threaten security or impend con- 
flict, guilt, or anxiety have been found 
to produce an increase in electromyo- 
graphic activity in patients with a his- 
tory of backache (Holmes & Wolff, 
1950). 

Similar observations have been made 
by Malmo, Shagass, and Davis (1950b) 
on a group of headache-prone patients. 
Whenever traumatic material was dis- 
cussed in interview sessions, such a pa- 
tient reported the presence of a headache 
and showed a rise in muscle-action 
potentials recorded from the forehead. 

In response to a series of painful 
stimulations elicited by a Hardy-Wolff 
thermal stimulator, patients with head 
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and neck pain again exhibited increases 
in muscle-action potentials from these 
areas. In contrast, those patients with 
clinical histories of cardiovascular com- 
plaints showed increases in heart rate, 
mean heart rate variability, and median 
respiratory variability, rather than dis- 
playing any increases in muscular 
activity (Malmo & Shagass, 1949b; 
Malmo, Shagass, & Davis, 1950a). 

In order to explain specific muscular 
effects Malmo, Shagass, and Davis 
(1950b) speak of “symptom specificity.” 
They believe that a given patient will 
respond maximally with the same physi- 
ological response, almost regardless of 
the stimulus. Accordingly, a particular 
physiological mechanism of somatic 
complaint is specifically susceptible to 
activation by many stressful situations. 

Evidence also has been gathered to 
show that specific bodily changes occur, 
not only in response to general emotional 
disturbances, but to particular emotional 
conflicts as well. Certain psychodynamic 
themes occurring during psychiatric 
interviews appear to be associated with 
increases in muscle tension in rather cir- 
cumscribed areas. Hostile material 
brings about an increase in forearm ten- 
sion, while sexual themes produce a rise 
in leg tension (Shagass & Malmo, 1954). 
It may be seen that certain patterns 
during emotional stress are exhibited by 
various individuals. Why some develop 
backaches or headaches, others more dif- 
fuse patterns, and others still feu a 
physiological patterns is not pO ee 
present. The particular mode Gs ne 
sponse, however, usually appen Eon 

part of a life-long means of E 
One might search for the pre an 

: kness of an orgi? 
factor in the wea ecially 

f iyated becomes esp 
which when activate? 1- suring stress. 
susceptible to stimulation ental factors 
At the same time, environne, for cer- 
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It is particularly interesting that many 
persons with the same manifestations 
during stress have common experiences 
(Wolff, 1953). It seems possible that 
bodily reactions conditioned by earlier 
exposure occur in similar situations en- 
countered at a later date. 

Concerning symptom specificity, a 

word of warning is introduced. Because 
an individual’s tension may become 
focused in a particular area, Cameron 
(1944b) believes that a general increase 
in tension may be overlooked. A certain 
area may have acquired special meaning 
for an individual and is activated dur- 
ing anxiety-provoking situations. While 
for some muscle tension is usually con- 
fined to the problem area, during ex- 
tremely intense anxiety states it may 
extend to other body parts and even to 
other systems. In addition, neuromus- 
cular contractions in particular body 
parts may be so intense that general 
feelings of tension may not be as 
obviously noticed, 


SUMMARY AND CONCLUSIONS 


In scanning the literature it becomes 
obvious that the definitions of muscle 
tension are as numerous as are the 
methods used to measure it. Some of the 
most indirect methods include ratings 
of tension, magnitude of knee jerk, eye- 
blink rate, motor restlessness, and 
tremor. Wendt (1938) notes that in 
most cases the correlation of these meas- 
ures with more direct indices of muscle 
tension has not been investigated. When 
investigations have been made the cor- 
relations are often very low, making it 
very likely that the various methods 
convey different results, _Even where 
the electromyogram is utilized to assess 
tension levels, different muscles haye 
been measured. In many cases record- 
ings are made of only one or two muscle 
groups, which have not always = 
representative of generalized muscle 


tension. 


In spite of the frequent lack of com- 
parability of studies, it is still possible 
to draw some generalizations and to 
perceive the emergence of certain trends. 
To begin with, distinct differences In 
tension level exist between individuals 
and tend to be maintained under many 
situations. There are also suggestions 
that muscle tension relates to a variety of 
traits, although the relationship is not 
always clear-cut. The common factor 
amongst these traits appears to be a 
general tendency to be extremely Te- 
sponsive to most stimuli. Accompanying 
the high muscle tension would be the 
tendency to act out and express emo- 
tion freely (i.e, to be more irritable, 
impulsive, energetic, active, excitable). 

When the situation becomes a partic- 
ularly stressful one, muscle-tension levels 
become elevated in almost all people, 
although certain individuals would still 
be more responsive than others. Among 
those individuals characterized by high 
frequency of elevated motor responses 
are psychiatric populations, particularly 
those with symptoms of anxiety. 

There is very little agreement in the 
literature as to which of the psycho- 
logical disorders is accompanied by the 
greatest amount of muscle disturbance. 
This may be accounted for partially by 
the noncomparability of the experi- 
mental situations. In addition, there 1s 
tremendous variation in the methodology 
of assessing muscle tension, as well as in 
the decision regarding which muscle re- 
Sponses to measure. Another problem 
arises because of the unreliability of the 
diagnostic categories, Finally, the 
chronicity of the psychosis or neurosis 
may have an important bearing on the 
Magnitude of muscle tension. 

In addition to those individuals who 
respond with increases in tension 
throughout their body, some people 
exhibit high degrees of tension in 2 
Specific muscle group. The ideas of both 
specific and general tension are not nec- 
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essarily exclusive; for it is possible for 
the same person to be tense throughout 
his body and yet respond to most situa- 
tions with the greatest amount of tension 
being focused in one particular muscle. 
Sustained muscle contraction in turn 
may lead to pain and symptoms in the 
corresponding muscle; so that the in- 
dividual reports headaches or backaches. 
Exactly why one muscle becomes so sig- 
nificant is due not only to the present 
situation, but to the physiological con- 
stitution of the person and to his past 
experiences as well. 
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therapeutic enterprise. 
ships are then explored: 
patient similarity, 


Psychotherapy has been variously de- 
fined. In this Paper, it will refer to 


understanding, but 
within which ther- 


of the latter, including his Ways of dealing with 


rotion lly toned needs and the situations 
that give rise to them [Shoben, 1953, p. 127]. 
Some writers on the s 
therapy have focuse 
which” aspects, citi 


ubject of Psycho- 
d on the “within 
ng numerous tech- 


niques as being mor 
Others have focused 
choth i 


paper addresses 
the latter assertion, 


covers the period 1946-62, 


CONCEPTIONS oF THE IDEAL 
THERAPEUTIC RELATIONSHIP 


Since a relationship may be charac- 
terized in a myriad of ways, anyone who 
asserts that the relationship itself js im- 
portant in psychotherapy must state 
which sorts of relationship are desirable 
and which not. As Shoben (1949) 
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cisms during the preparation of this paper. 


426 


of the literature on the 
the ideal psychotherapeutic 
f this ideal are shown to 


arious measures of patient progress in the 
Other correlates of 


patient variables, therapist variables, therapist- 
and technique variables, 


good therapeutic relation- 


- In all these areas, signifi- 


pointed out, there is considerable agree- 
ment on this issue. The characteristics 
most frequently cited as desirable are the 

erapist’s warmth, acceptance, permis- 
siveness, respect for the patient, under- 
Standing, interest in the patient, and lik- 
ing for the patient, Rogers (1957, 1959) 
made the additional stipulation that, 
in successful therapy, the patient must 
be able to perceive these therapist quali- 
ties, and he also asserted (Rogers, 1954) 
that the patient must like and respect the 
therapist, 

A number of scales which have been 
constructed (Anderson & Anderson, 
1954; Apfelbaum, 1958; Chase, 1946; 
Fiedler, 1950b; McClelland & Sinaiko, 
1950; Sundland & Barker, 1962) for the 
Purpose of defining the ideal relationship 
indicate that, while the factors cited 
above are the modal conception, agree- 
ment is far from perfect. Thus the issue 
is raised as to what factors are related 
to agreement with the modal conception. 

Chase (1946) derived his scale from 
Statements about counseling procedure 
Which were endorsed by a majority of 
“expert” counselors, Counseling stu- 
dents’ attitudes Senerally did not agree 
with those of the experts and did not 
Correlate either with grades in the coun- 
seling course or with Army General Clas- 
Sification Test Scores, Chase concluded 
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that acquisition of effective counseling 
attitudes was not related to scholastic 
achievement and probably was a func- 
tion of actual experience in the counsel- 
ing situation. 

McClelland and Sinaiko (1950) re- 
fined the Chase scale and administered it 
to undergraduate and graduate psychol- 
ogy students. Comparisons within each 
of the two levels confirmed Chase’s find- 
ing regarding scholastic achievement. 
Between-group comparisons, however, 
showed higher scores for the graduate 
students, and the authors concluded that 
training helped the students to acquire 
their instructors’ attitudes. Of course 
this discrimination may have been more 
a function of self-selection of graduate 
students than of the training itself. Ar- 
buckle (1956) found high agreement 
among graduate students’ descriptions of 
the ideal counselor. In both studies, 
however, it is a moot question whether 
the graduate students actually believed 
in the attitudes they professed or merely 
were more adept at judging what answer 
was expected. 

Fiedler (1950b), using a Q sort tech- 
nique, found for seven therapists repre- 
senting three schools that there were no 
significant differences in conception of 
the ideal therapeutic relationship be- 
tween therapists of different schools, but 
that experienced and inexperienced ther- 
apists of the same school did differ 
significantly from each other. He argued 
that ability to describe the ideal thera- 
peutic relationship was a function of ex- 
perience rather than of theoretical al- 
legiance, Though Fiedler’s evidence has 
often been cited as conclusive, it has not 
gone unchallenged. Apfelbaum (1958) 
criticized Fiedler’s Q deck on the 
grounds that the items reflected extreme 
positions and thus tended to compel 
agreement among sorters, resulting in 
spuriously high intersorter correlations. 

Behar and Altrocci (1961), using a 
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scale constructed by Apfelbaum, asked 
nursing students to describe the ideal 
psychiatric nurse. Participation in psy- 
chiatric nursing courses seemed to pro- 
duce high agreement, whereas actual 
experience with psychiatric patients did 
not. The authors concluded that they 
had refuted Fiedler’s (1950b) hypothe- 
sis concerning experience, and that train- 
ing instead was the critical variable. 
However, they failed to note that the 
high-agreement groups had had consider- 
able experience with nonpsychiatric pa- 
tients. Since it is quite doubtful that 
student nurses are taught to hold sig- 
nificantly different attitudes toward psy- 
chiatric patients as opposed to patients 
in general, the authors’ conclusions are 
questionable. The training and experi- 
ence variables were not properly con- 
trolled in either this study or Fiedler’s 
(1950b); thus the issue remains un- 
resolved as to which contributes more to 
agreement on good therapeutic attitudes. 
Sundland and Barker (1962) con- 
structed a scale which did differentiate 
between therapeutic schools in terms of 
therapeutic attitudes. The criterion for 
item selection was that the items de- 
scribed points of controversy between 
schools: thus all that can be concluded 
is that the authors successfully validated 
their scale. In comparing their results 
to Fiedler’s the authors recognized that 
different attitudes were measured, and 
they properly raised the issue of the ex- 
tent to which differential responses to 2 
scale may be a function of its content 
rather than of the persons responding. 
A related issue concerns the appropriate 
degree of specificity of items in an atti- 
tude scale. While everyone may agree 
that a therapist should be warm and Be 
cepting, there may be considerable re 
agreement as to what these at em 
ally imply. Sundland and Darker . od 
found, contrary to Fiedler’s (19501 ) A 
sults, that experienced and inexperience 
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therapists did not differ in their re- 
sponses. But it must be noted that if a 
therapist is trained in a given school of 
thought, he does not need much experi- 
ence to learn his school’s position on the 
more controversial issues. Thus Fiedler’s 
findings regarding experience and schools 
have still not been conclusively refuted, 
Three studies (Anderson & Anderson, 
1954; Fiedler, 1950b; Thomas, Polan- 


Tespective fields, 


explain the fact that Anderson 
derson’s (1954) patients were successful 
in the task while Behar ang Altrocei’s 
(1961) nonnursing students failed, Per. 
sonal commitment to the significance of 
helping relationships may be a more per- 
tinent variable than experience per se, 
Professional training does not seem to 
be especially relevant. 


and An- 


G. GAIL GARDNER 


RELATIONSHIP Factors AND THERA- 
PEUTIC CHANGE 


We now consider studies in which as- 
pects of the patient-therapist relation- 
ship are related to therapeutic change. 
The studies will be discussed in blocks 
according to the way in which the rela- 
tionship variable is measured. 


Therapist Ratings 


It is difficult to obtain independent 
measures of the two variables under con- 
sideration. When one judges a helping 
relationship to be good in that it satis- 
fies the conditions outlined above, one 
tends to experience a feeling of satisfac- 
tion which in turn acts as a set for per- 
ceiving progress toward therapeutic 
goals. Similarly, the converse is une 
Even when one variable is measure 
alone, it is difficult not to get cues > 
garding the other, Given this inheren 
contamination, it only adds fuel to T 
fire to Propose a design in which the sam 
People rate both variables, The following 
two studies suffer from this error, an 
their results should be interpreted wit 
caution, 

Gorlow, Hoch, and Telschow (1532) 
studied therapy groups and found w 
the therapists had greatest liking for : al 
Most profited group, Since no statisties 

ata were provided, it cannot ba asr 
tained how strong the relationship W n 
between liking and progress. aa 
(1954) found that therapist ratings ° 
liking for their patients correlated c- 
with therapist ratings of therapeutic SY e 
cess (M=23), Retest reliability of t- 
ratings was satisfactory, Since both 13 
ings were made at the end of therapy 
Contamination js very likely. Th eni 
ratings of the degree to which the pane 
used his relationship to the therapist to 
a focus for therapy were not related s 
success. This finding casts doubt 0”, 14 
validity of notions to the contrary 
by many therapists. ore 

Snyder (1961 ) had significantly ™ 
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positive affect toward his more improved 
patients. But the facts that his relation- 
ship and progress measures overlapped 
and that the patients were also his stu- 
dents suggest caution in interpreting 
these data and others from the same re- 
search to be reported below. 

Two studies (Coons, 1957; Lesser, 
1961) utilized projective and other per- 
sonality tests to measure therapeutic 
change and thereby minimized the possi- 
bility of contamination. Coons (1957) 
found that a therapy group which fo- 
cused on interaction showed significantly 
more improvement than either a group 
that stressed insight into psychological 
problems or a no-therapy control group. 
The latter two groups did not differ from 
each other. Coons concluded that inter- 
action is the essential condition for 
therapeutic change. These findings con- 
tradict those of Seeman (1954). The 
difference may be due to the fact that 
Coons’ patients were hospitalized schizo- 
phrenics and Seeman’s were neurotics. 
Since withdrawal from interpersonal in- 
teraction is a major symptom of schizo- 
phrenia, whereas it may or may not be a 
central issue in neurosis, it seems en- 
tirely reasonable that measures of im- 
provement would be more closely related 
to interaction in the former case than in 
the latter. The fact that Seeman’s pa- 
tients were in individual therapy and 
Coons? were in group therapy may also 
contribute to the difference in results. 

Lesser (1961) found no relationship 
between therapists’ ratings of their own 
empathic understanding and differences 
in patients’ self-ideal discrepancies (But- 
ler & Haigh, 1954) before and after 
therapy. It should be noted, however, 
that the number of hours of therapy 
varied from 3 to 12. For most patients 
in treatment for this short a period, even 
the most expert therapists cannot hope 
to effect significant changes in the more 
Permanent personality constructs meas- 
ured by the Butler and Haigh Q sort. 


In all these studies, of course, the 
problems inherent in self-reports are 
relevant. 


Patient Ratings 


Studies in which patients made both 
judgments (Feifel & Eells, 1962; Grigg 
& Goodstein, 1957; Lipkin, 1948) also 
suffer from interdependent measures and 
must therefore be interpreted with cau- 
tion. A patient who feels warmly toward 
his therapist has more reason to judge 
himself improved; conversely, one who 
feels himself improved is more likely to 
have positive feelings toward his thera- 
pist, especially in retrospect. 

Lipkin (1948) found that patients 
who mentioned the counselor’s permis- 
sive attitude in posttherapy evaluations 
usually felt that it helped them to 
achieve insight into their problems. 
Though the length of treatment was no 
greater than that employed by Lesser 
(1961), the improvement criteria were 
much more situational; this difference 
probably explains the discrepant results. 
Grigg and Goodstein (1957), using a 
follow-up questionnaire, found that 
clients who felt a close relationship with 
their counselor reported significantly 
more favorable outcomes than did clients 
who felt a more distant relationship. 
Feifel and Eells (1962) obtained similar 
results. 

Van der Veen (1961) utilized ratings 
by outside judges as the measure of im- 
provement. He found significant posi- 
tive correlations between patient ratings 
of the degree to which therapists pro- 
vided a positive relationship and judges 
process ratings. 

A few studies have relied on test 
as the measure of progress in therapy. 
Lipkin (1954) measured therapy change 
by global judgments of Thematic Apper- 
ception Test protocols. He found that 

successful patients focused less than n 
successful patients on their nelle 
confide in the therapist, and when they 


scores 


G. GAIL GARDNER 
430 


i s, it was in a more positive 
ee 1 patients focused 
on mistrust. Lesser (1961) found no 
relationship between client ratings of 
therapist empathy and changes in self- 
ideal discrepancies. These negative re- 
sults have been explained in the section 
on therapist ratings. Snyder (1961) re- 
ported a significant positive relationship 
between patients’ attitudes toward the 
therapist and their classification as better 

or worse patients. Since, however, the 

measure of the two variables over- 
lapped, the results are contaminated. 


Ratings by Outside Judges 


In studies in which judges rated both 
therapeutic change and the quality of 
the therapeutic relationship, it is not 
always clear whether or not the same 
judges made both ratings. Even if they 
did, these studies have merit over those 
described above in that the patient- 
therapist relationship is now rated by 
people who are not personally involved 
in the outcome of therapy, 

Holt and Luborsky (1952) found that 
resident psychiatrists rated by their su- 
Pervisors as Successful were also rated as 
having better relationships with patients 
than were residents rated as unsuccess- 
ful. Unfortunately, the ratings were 
contaminated by the fact that the super- 
visors themselves ha 
ships with the residents they judged to 
be successful. Luborsky (1952) noted 
that the successful residents were more 
flexible with respect to the Tange of be- 
havior which they felt appropriate for 

their patients. Apparently the less suc- 
cessful residents tended to try to cast 
their patients into a preconceived mold. 
The finding that the successful residents 
had better relationships with ward per- 
sonnel and fellow residents as well as 
with supervisors and patients lends some 
support to Fiedler’s (1950b) hypothesis 
that good therapy relationships are not 


unique entities but rather instances i 
good relationships in general. Pa 
(1956) findings further support this 
notion. : 

Knupfer, Jackson, and Krieger (1959) 
noted the authoritarian nature of the re- 
lationship between the supervisor and 
the beginning therapist, and argued that 
supervisors’ ratings may be heavily 
weighted with evaluations of the thera- 
pists’ attitudes toward authority. Never- 
theless, such ratings are probably more 
valid than the therapists’ own ratings. 

Van der Veen (1961) reported greater 
process-movement scores for clients 
whose therapists were judged to create 
better relationships. Truax (1961a) re- 
ported similar results and properly 
pointed out that causality had not been 
demonstrated. One study (Parloff, 
1961), in which it was made clear that 
the judges of the relationship were dif- 
ferent from the judges of therapeutic 
change, reported significant positive cor- 
relations between the two variables for 
patients in group therapy. 

Two studies (Hiler, 1958; Parloff, 
1961) relied on patient reports and be- 
havior as the measure of change. Parloff 
(1961) noted a positive correlation be- 
tween judged quality of the therapeutic 
relationship and patient reports of relief 
from symptomatic discomfort. In Hiler’s 
(1958) Study, therapists rated by staff 
Psychologists as warm were better able 
to keep unproductive patients in treat- 
ment. 

Truax (1961b, 1962) measured 
change by various test scores, including 
the Minnesota Multiphasic Personality 
Inventory (MMPI), and reported that, 
for two samples of schizophrenic pa- 
tients, judged therapist empathy was 
Positively related to improvement. 

Three studies measured therapy 
change with a combination of ratings 
and test scores. Aronson (1953) Te 
ported no differences in improvement for 


» 


SODS 


THE PSYCHOTHERAPEUTIC RELATIONSHIP 431 


clients of four therapists who were 
judged by their peers to have significantly 
different degrees of ability for warm in- 
terpersonal relationships. The fact that 
both therapists and judges were gradu- 
ate students and that the judgments were 
not limited to therapist-patient relation- 
ships may have contributed to the null 
results. Truax (1961b), in a similarly 
designed study, obtained positive results 
for both neurotic and schizophrenic 
samples. His research differed from 
Aronson’s primarily in that his relation- 
ship judgments were based on actual 
therapy sessions. Stoler’s (1961) judges 
did not concern themselves with the 
therapists, but rather rated the likability 
of the patients on the basis of tape-re- 
corded segments of therapy. Patients 
identified as more likable were more 
often identified as improved. 

In evaluating all these studies, one 
must consider that, when there is little 
agreement as to the nature of improve- 
ment and no conclusive evidence that 
any form of change occurs as a result 
of psychotherapy, it seems premature 
blithely to examine the correlates of 
therapeutic change. Yet there seems to 
be no other appropriate course of action. 

The evidence that the quality of the 
therapeutic relationship is a correlate of 
therapeutic change lies not in the con- 
clusive results of any one study but 
rather in the repeated findings of a series 
of studies, most of which contain one or 
more serious defects. Methodology 
varies greatly, and absence of precise 
definitions often makes it difficult to dis- 
cern whether the “good relationship” of 
one study contains the same elements 
as that of another study or different. In 
a sense, the diversity of procedure 
Strengthens the force of the conclusion. 
Null results do not cluster in any one 
Methodological cell, and, in all types of 
design, positive results occur far more 
frequently. Further support derives from 


the high agreement among different types 
of raters regarding the nature of the re- 
lationship between a given therapist and 
patient. Miller (1949) reported nonsig- 
nificant differences between the judg- 
ments of four raters with widely varying 
amounts of training. Parloff (1961) and 
Snyder (1961) reported correlations of 
.79 and .70, respectively, between pa- 
tients’ and judges’ or therapists’ descrip- 

tions of the patient-therapist relation- 

ship. 


OTHER Factors RELATED TO THE Psy- 
CHOTHERAPEUTIC RELATIONSHIP 


Patient Variables 


Diagnosis. Parloff (1961) found no 
correlation between initial patient eval- 
uations and subsequent measures of the 
quality of the therapeutic relationship. 
Since the patients were all neurotic, the 
range of maladjustment probably was 
not very great. Hollingshead and Red- 
lich (1958) reported an inverse relation- 
ship between patient maladjustment and 
therapist liking for patients ranging over 
the whole gamut of disturbance. How- 
ever, since maladjustment was also in- 
versely related to social class, and since 
liking was positively related to social 
class, the results may be an artifact. 
Snyder (1961) indicated that the pa- 
tients toward whom he felt most friendly 
were relatively more energetic, unin- 
hibited, and suggestible. On the pre- 
therapy MMPI, these patients were more 
hypomanic and obsessive; the least-liked 
patients tended to be more schizoid and 
depressive. 

Prognosis. Since diagnosis and prog- 
nosis are usually themselves related, it 
should follow that prognosis is correlated 
with therapists’ affective attitudes. Strupp 
(1958, 1960) and Wallach and Strupp 
(1960) found this to be true. _ Strupp 
(1960) commented on “the ubiquitous 
but insufficiently realized effects of the 
therapists’ attitudes as they permeate 
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and color his clinical observations and 
judgments [p. 28]? Of Course it must 
be realized that the direction of influence 
may also be the other way. 

Motivation for Therapy. In three 
studies (Raskin, 1961; Strupp, 1960; 
Wallach & Strupp, 1960), therapists had 
greater liking for patients who evidenced 
a desire to change and improve. This is 
hardly surprising since motivation for 
therapy is usually a central consideration 
in making a prognosis, Wallach and 
Strupp inferred from their data that the 
patient’s motivation probably influences 
the therapist’s attitude only when the 
degree of maladjustment is not too great. 

Raskin (1961) noted that liking was 

more effective as a Predictor of high- 

than of low-motivation ratings. Consid- 
ering the Correlational design of his re- 
search, the use of the word “predictor” 
seems inappropriate. Still the finding 
warrants further study. 


Capacity for Friendliness. 
Studies 


Social Class. Hollingsheaq and Red- 
lich (1958) observed that therapists’ at- 
titudes toward their patients were posi- 
tively related to the patients’ social class, 
This was true even when the range of 
maladjustment was curtailed by studying 
only the neurotic sample. The authors 


felt that therapists generally are unable 
to understand lower-class values and 
hence are less prone to like persons hold- 
ing such values. Furthermore, the thera- 
pist’s technical skill is threatened by u 
lower-class patient’s tendency to demanc 
that the therapist behave in an authori- 
tarian manner. 

Expectations Regarding the Therapist. 
Apfelbaum (1958) found that parien 
who expected a high degree of warmth 
from the therapist did not differ on prê- 
therapy MMPI scores from patients Ta 
expected a low degree, and that bar 
these groups had more maladjuste 
MMPI scores than a group which ex- 
pected a moderate degree of warmth. It 
could be hypothesized that therapists 
would have the most positive feelings for 
this middle group. 

That therapist liking results from per- 
ceiving a “good patient” rather than the 
reverse was indicated by Strupp an 
Williams (1960) who found that two 
therapists’ ratings of the following varia- 
bles were all highly intercorrelated: pa- 
tients’ defensiveness, patients’ capacity 
for insight, patients’ motivation for 
therapy, patients? prognosis, and thera- 
pists’ liking for the patients. When 
therapist liking was partialed out, there 
was no change in the magnitude of the 
other coefficients, Two writers (Fiedler, 
1953; Truax, 1961b) reported contra- 
dictory data. In each of these studies, 4 
given therapist’s attitudes were foun 
not to vary across different patients 
Homogeneity of sampling may accoun 
for the null results in both cases. ) 

While it appears that warm relation- 
ships can best be established with “good 
patients who are motivated for therapy» 
only moderately maladjusted, friendly» 
submissive, and who represent the m! F 
dle or upper social classes, persons W A 
do not fit into this category can pomy 
times be treated successfully. In order 
to raise this success rate, it would prob# 


A 
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bly be fruitful to investigate which sorts 
of therapists are most capable of forming 
positive relationships with persons who 
are not good patients. 


Therapist Variables 


Personality Characteristics. Three 
studies (Ashby, Ford, Guerney, & Guer- 
ney, 1957; Brams, 1961; Fiedler & 
Senior, 1952) correlated a large number 
of therapist personality variables with 
measures of the quality of the therapeu- 
tic relationship. Positive results occurred 
no more often than would be expected by 
chance. 

Holt and Luborsky (1952) reported 
that their successful psychiatrists—who 
maintained better patient-therapist rela- 
tionships—were in the middle of the 
range on dimensions of control of affect 
and emotional impulsivity. Unsuccess- 
ful psychiatrists tended to be either im- 
pulsively expressive with their patients 
or overcontrolled with no expression of 
feeling. It may be that other therapist 
qualities have a curvilinear relationship 
with attitudinal variables and that their 
relevance has been refuted because only 
linear relationships were investigated. 

Streitfeld (1959) found that ratings of 
self-acceptance (SA) and acceptance of 
others (AO) were not related to ratings 
of therapeutic competence. He concluded 
that, while general acceptance of others 
is not related to psychotherapeutic 
ability, acceptance of specific patients 
probably is related. The null results 
might also be explained by the fact that 
the therapists were all graduate students. 
Interestingly, therapists who obtained 
low scores on SA and AO rated their 
therapeutic ability similarly to the way 
in which their supervisors rated it, 
whereas this was not true for therapists 
high on SA and AO. The author sug- 
ested that high SA and AO therapists 
may be defensive and immature. A simi- 
lar conclusion may be drawn from data 
reported by Fiedler and Senior (1952). 


Correlational techniques which are sensi- 
tive to curvilinearity may be most ap- 
propriate for studying the self-accept- 
ance variable. 

Familiarity with the Patient. Stoler 
(1961) found that raters who were famil- 
iar with patients’ cases judged the pa- 
tients as more likable than did raters 
reviewing the cases for the first time. 

Ability to Predict Patient Behavior, 
A number of studies (Affleck & Garfield, 
1961; Chance, 1959; Dymond, 1953; 
Fiedler, 1953; Luft, 1950; Melton, 
1952) have found therapists’ predictive 
ability to be quite poor and generally no 
better than that of persons who are not 
therapists. Correlations of this ability, 
such as it is, with relationship factors 
have variously been described as positive 
(Kahn, 1957), negative (Smith, 1960), 
and zero (Mellinger, 1956). Snyder 
(1961) reported positive but nonsignifi- 
cant differences in his ability to predict 
questionnaire responses of his better and 
poorer clients. 

Fiedler (1953) commented that thera- 
peutic understanding refers primarily to 
making the patient feel understood and 
not to any objective diagnostic or pre- 
dictive ability. The evidence is generally 
in support of his implication, namely, 
that predictive ability is not a relevant 
therapist variable. 

Experience. Parloff (1956) reported 
that, for two experienced therapists, 
judges agreed that one created better 
therapeutic relationships than the other. 
Fiedler (1950a, 1951a) found that ex- 
pert therapists were judged better able 
to approximate the ideal relationship 
than were novices. A major weakness in 
the study was that the judges could dis- 
tinguish the experts—who were nation- 
ally known—from the novices on the 
basis of their voices. It is impossible to 
estimate the extent to which halo and 
“pitchfork” effects, respectively, were 
operating. Three other studies (Chance, 
1959; McGowan, 1954; Strupp, 1958), 
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in which this defect was avoided, ob- 
tained similar results. Hollingshead and 
Redlich (1958) found that inexperienced 
therapists tended more to dislike lower- 
class patients than did experienced 
therapists. It may be that the experience 
variable is most relevant in the case of 
patients who do not fit into the “good 
patient” category, 


Therapist-Patient Similarity 


Assumed Similarity (AS). This con- 
struct (Fiedler, 1951b) refers to the de- 
gree to which a therapist believes that 
his own personality is similar to hi 
tient’s personality, 
Forge, and Gilbert 


S pa- 
Hunt, Ewing, La- 


Real Similarity, Lesser (1961) found 


that similarity of self-concept between 
therapist and patient w 


empathy scores, Vogel 


an reflected, his 
own, In this research also, n Dominance 


was the primary factor, Hollingshead 
and Redlich (1958) noted that thera- 
pists had more positive feelings toward 
patients whose social-class backgrounds 
were similar to their own. 

Carson and Heine (1962) found a 
curvilinear relationship between thera- 
pist-patient similarity on the MMPI and 
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therapeutic success ratings. That is, pa- 
tients whose profiles were moderately 
similar to their therapists’ were judged 
most improved. The authors reasoned 
that high similarity prevents the thera- 
pist from maintaining suitable distance 
and objectivity whereas high dissimi- 
larity prevents him from being able to 
empathize with or understand his pa- 
tient’s problems. i 

These studies strongly support Levin- 
son’s (1961) thesis that patient-thera- 
pist similarity cannot be thought of as a 
unitary trait. Some similarities may fa- 
cilitate good relationships and therapeu- 
tic progress, while others may be sources 
of impasses, 


Technique Variables 


Relative Importance of Relationship 
and Technique. Shortly before relation- 
ship variables came into vogue, Seeman 
(1949) studied reactions of clients who 
had been counseled by directive and 
nondirective methods, He found signifi- 
cantly different client reactions for pairs 
of counselors whose techniques were 
similar and nonsignificant differences 
where techniques were different. He con- 
cluded that some factor other than 
therapeutic method was producing dif- 
ferences in client reactions, 

Fiedler (1950a, 1951a, 1953) rushed 
in to fill the vacuum, affirming Seeman’s 
(1949) conclusion and stating that the 
therapeutic relationship is the critical 
variable in successful therapy. Fiedler 
(1953) Commented, however, that 4 
given technique might serve to make @ 
therapist feel more secure and thus 1n- 
directly affect the course of therapy. , 

More recent research (Ford, 1956; 
Forgy & Black, 1954; Snyder, 1957) has 
Placed technique in a somewhat more 
favorable light. These writers have con- 
cluded that relationship and success vary 
not as a function of technique alone, but 
rather as a function of a particular thera- 
pist using a Particular technique. They 
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feel that the therapist and his method 
must be viewed as a single unit. 

Strupp (1958) concurred that both 
relationship and technique factors are 
important in successful therapy. In- 
terestingly, in a later article (Strupp, 
1962), he was not so willing to commit 
himself and stated: 

For some years it has been held that the 
quality of the therapeutic relationship is more 
basic to therapeutic success than the thera- 
pist’s specific methods and techniques. .. . 


It may be predicted that the issue will remain 
alive for some time [p. 452]. 


Covariance of Relationship and Tech- 
nique Factors. Two studies (Aronson, 
1953; Strupp, 1958) reported a negative 
correlation between the degree of 
warmth in therapists’ attitudes toward 
their patients and the extent to which 
they used directive techniques. Apfel- 
baum (1958) studied patients’ pre- 
therapy expectations of their therapists’ 
behavior and reported a curvilinear rela- 
tionship between the two variables. 
Therapists expected to be at the ex- 
tremes of the warmth dimension were 
expected to employ directive techniques; 
therapists expected to exhibit a moderate 
amount of warmth were expected to be 
relatively more nondirective. The dis- 
crepancy between these data and those 
of Strupp (1958) and Aronson (1953) 
may be explained in two ways: either 
therapists do not exist who are as warm 
as some patients expect or such thera- 
pists do exist but were not represented 
in Strupp’s and Aronson’s samples. Of 
course variations in measuring instru- 
ments may also account for the differ- 
ences. 

Throughout this paper, considerable 
weight has been given to methodological 
issues. It is indeed a pity that research 
findings must so often be questioned be- 
cause of methodological defects. Not 
only the nature of the measurements but 
also the point at which they are made 
should be considered, especially in the 
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case of patients’ and therapists’ self-re- 
ports. While time may dim the memory, 
it may also extinguish inhibitions and 
thus allow greater honesty as regards 
negative feelings and evaluations (Feifel 
& Eells, 1962). 
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Increases in activity level have often been used to imply the presence of 
a heightened drive state. Frequently, drive is also anchored on the ante- 


cedent side to a condition of need, 
the relationship between need state a: 
cifically, activity level is discussed 
activity deprivation. It is suggeste 


activity level may be more meanin 


Students of motivation who employ 
the concept of “drive” as the origin or 
l activity of an 
eir experimental 
identify or meas- 
in the organism 

drive. Hence, 
drives are unknowns, postulational or 
under our pres- 
Their spatial 


“drive,” drawn from general 


Sources, has implications be- 
yond mere antecedence—its connotation 


is similar to “forced,” in Loeb’s classic 
definition of But it is the 


stances which operate prior to the marl 
lated drives. The behavior of organisms 
which is usually measured may be pri- 
marily locomotor, or it may consist of 
postural shifts, or be even more vaguely 


1The authors would like to express their 
gratitude to K. C. Pratt and N. R. Ellis for 
their helpful suggestions in the preparation of 
this manuscript. 
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Research literature which bears on 
nd activity level is reviewed. Spe- 
in terms of food, water, sex, and 
d that associative interpretations of 
giul than drive interpretations. 


described as “restlessness,” or termed 
“general,” “random,” or “spontaneous 
activity. y 

The precursor or antecedent of onma 
has been designated by some as a nee 
state. For the purposes of this discus- 
sion, “need state” may be defined as 
any condition of deficit or excess which 
causes the organism to depart from the 
dynamic equilibrium known as homeo- 
stasis, 

Research upon the relationships be- 
tween need, drive, and activity encoun- 
ters many and varied problems which 
do not admit of any easy or entirely 
Satisfactory solutions. Among these the 
following may be cited as fundamental: 


1. The determination of the base line 
of activity since the organisms studied 
all manifest some motility even under 
homeostasis (or what one is tempted to 
call a “steady state” » from which 
changes in level are to be evaluated. 

2. The quantitative control of the 
Conditions (need states) which result 
in a significant change in activity from 
that which characterizes or constitutes 
the base line, : 

3. The measurement of the activity 
level. 

4. The specification of the processes 
by which need states are translated int 
overt activity. The most common aP 
proach here is to postulate an interve?” 
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ing drive state. Changes in activity level 
which are correlated with changes in 
need state are frequently attributed to 
the “energizing” function of drive 
(Bolles, 1958). 


These are not simple problems. In- 
deed there are grounds for questioning 
whether any truly unitary constructs 
are warranted at this time. Activity 
level at present defies any definition 
except in terms of measurements ob- 
tained from various activity wheels, 
stabilimeters, photoelectric devices, 
open-field tests, etc. As we shall see, 
the data obtained with one type of meas- 
uring device are not necessarily the same 
as those from another device under the 
same need states. 

The aim of this paper is to summarize 
and to evaluate some of the experimen- 
tal literature in which there has been 
an attempt to relate activity level to 
need state. 


Foop RESTRICTION 


The most intensively and carefully 
studied need states, as they relate to 
activity, are hunger and thirst. Com- 
mon observation and experience tell one 
that as an animal becomes hungry or 
thirsty it behaves in a manner that typi- 
cally relieves this condition; that is, it 
reduces the physiological deficit and re- 
turns to a state of relative equilibrium. 
The animal will run about, pacing, sniff- 
ing, and in general will appear to ex- 
plore its environment. Observation of 
infant animals indicates that they dis- 
play considerable random activity even 
after short periods of food and water 
deprivation. Obviously, an organism 
must show some mobility (at least in 
its natural habitat) in order to survive. 
Research findings tend to support in- 
formal and common-sense observations 
that rate of response is related to se- 
verity of the deprivation. Activity, thus, 


has become a measure of drive. Or, 
stated more appositively, heightened 
activity level in the presence of an 
altered need state has been attributed to 
the energizing function of drive. 

Many investigators have shown that 
activity (mainly in a running wheel) 
increases up to a point as a function of 
food and water deprivation. One of the 
clearest examples of this relationship 
is found in the work of Moskowitz 
(1959), who reports that an almost 
linear function obtains between running 
wheel activity in the rat and decrease 
in body weight as controlled by food 
and water deprivation. Other studies 
(eg., Finger, 1951; Hall, 1956) have 
shown that activity in wheels increases 
directly as a function of the number of 
hours that rats are deprived of food. 
An increase in activity in relation to 
food deprivation has sometimes been 
noted with respect to bar pressing (Se- 
gal, 1959; Webb, 1948), beam inter- 
ruptions (Siegel & Steinberg, 1949), 
open-field behavior (Dashiell, 1925; 
Fehrer, 1956), treadmill running (Baba, 
1959), and stabilimeter behavior (Rich- 
ter, 1922; Teghtsoonian & Campbell, 
1960; Teitelbaum, 1957). Irwin (1932), 
using the stabilimeter technique, found 
a significant correlation between the 
activity of human infants and lapse of 
time from a feeding period. Among the 
more well-established findings is the 
fact that animals show regular cycles of 
activity and eating. Following eating, 
activity decreases markedly (Finger, 
1949; Richter, 1927). 

Unfortunately, many qualifications 
cloud the apparent general relationship 
between deprivation and activity. 
Among the considerations that limit 
this generalization are: (a) findings 
that the respective deprivation states 
do not consistently produce similar 
changes in activity; (b) that intercor- 
relations between measurements from 
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the various devices are generally low; 
(c) that the relationship may be a a 
direct one which is, in turn, dependen 
upon stimulus conditions, learning, and 
physical condition; (d) that different 
species, perhaps even strains, show dif- 
ferent effects; (e) that sex differences 
are apparent in activity level; and (f) 
that age is a contributing factor. Then, 
too, the interactions among these factors 
must be considered. For example, it 
may be that changes in activity level as 
a function of food deprivation are de- 
pendent upon the nature of the appa- 
ratus and the stimulating conditions. 
The question arises whether depriva- 
tion per se, as manifested by changes 
in blood chemistry or some other physi- 
ological mechanism, is a direct antece- 
dent factor in increasing activity. That 
is, are increases in activity level the un- 
learned effects of the physiological con- 
sequences of deprivation, or does depri- 
vation sensitize the organism to react 
to features and changes in its environ- 
ment to which, in a satiated state, it 
does not respond? According to the 
latter point of view one might expect 
activity to remain relatively stable in 
an unchanging environment even though 
the organism were deprived. Moreover, 
the activity should increase as a func- 
tion of change in the environment and 
increased need state. This is a funda- 
mental issue. It Taises the question 
whether or not activity adequately and 
directly reflects the strength of drive, 
Campbell and Sheffield (1953) con- 
tested the classical view that activity is 
a resultant of an increased drive state 
due to a heightened condition of primary 
need. They found only minimal stabij- 
limeter increments in activity over 72- 
hour food-deprivation intervals when 
external factors related to vision and 
audition were held constant by rigid 
environmental control. They explained 
their findings by suggesting that hunger 
serves to reduce perceptual thresholds 


for external stimuli and that it is these 
stimuli, in turn, which initiate activity. 
Segal (1959) studied bar-pressing a 
havior in relation to these factors an 

came to much the same conclusions. 
This position bears a similarity to one 
advanced by Geier (1942), who con- 
tended that the tension produced by the 
expectancy of food makes the organism 
more sensitive to the stimuli impinging 
upon it. In effect, these writers reject 
the traditional notion of drive as > 
energizer and assign to it the role 0 
alerter. 

Several additional studies offer sup- 
port for the hypothesis that the primary 
condition leading to increases in depri- 
vation-related activity is external stimu- 
lation, while conditions of need merely 
Serve to sensitize the organism to stimu- 
lation. Baker (1953), in examining 
closely the cyclic nature of activity, 
has found that the characteristic cycle 
is absent if the animal is deprived and 
its environment is held constant. Lu- 
bow and Tighe (1957) reported that 
increases in illumination raise activity 
level in a stationary apparatus in E 
degree proportional to the amount O 
change in intensity. Decreases in il- 
lumination, however, appeared to have 
little effect, Similarly, Roberts (1962) 
found that stimulus changes produce 
recovery in bar-pressing behavior. Mon- 
keys showed no alteration in activity 
level up to 54 hours of food deprivation 
in a reasonably constant environmen 
(DeVito & Smith, 1959), Teghtsooniar 
and Campbell (1960) and Amsel an 
Work (1961) have compared the a 
tivity of deprived animals whose €” 
vironment was held constant with others 
subjected to changes in the level ° 
stimulation. The effect was the ame 
The nonstimulated group remained ae 
its base-line activity while the othe 
group showed substantial increases: 
Nevertheless, in mice, at least, the 
lationship between changes in illum 
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nation and activity is partly dependent 
upon strain (McClearn, 1959, 1960). It 
should be pointed out, however, that 
studies of human infants (Irwin, 1930, 
1941) suggest that their activity de- 
creases as illumination increases. Peri- 
ods of continuous auditory stimulation 
also show decreasing activity as com- 
pared with periods which contain no 
auditory stimuli. Further, if the in- 
fant is subjected to both illumination 
and auditory stimulation there is an 
even greater reduction in stabilimetric 
activity—in other words, the presence 
of these forms of energy are additive in 
reducing activity. 

It should be pointed out in regard to 
these studies that the term “constant” 
can only be applied in a relative sense. 
That is, the control animals by virtue of 
moving about in their cages, however 
minimal such movement may be, are 
changing the environment to the ex- 
tent that the stimulus conditions are 
altered. Furthermore, it is apparent 
that in spite of the most rigorous efforts 
of the experimenter to maintain a 
constant environment, small variations 
in noise, temperature, odors, etc., are 
bound to occur. 

Campbell and Sheffield (1953) fur- 
ther theorize that activity is a function 
of the stimulus effect of fractional an- 
ticipatory eating responses which are 
conditioned to the external environment. 
While it is difficult to comprehend the 
exact process by which anticipatory 
eating responses would be conditioned to 
environmental changes, particularly 
those which the animal has not experi- 
enced previously, or which it cannot 
Consistently associate with hunger re- 
duction, it does seem that prolonged 
deprivation of specific foods might bring 
about a lowering of thresholds for spe- 
Cific kinds of stimulation or that general 
deprivation might be followed by an 
Increased sensitivity to almost any 
variety of stimulation. 


Another possible interpretation is sug- 
gested by the arousal function of the 
reticular formation (Bolles, 1958), 
Centers that respond to specific deficits 
may exert a disinhibiting influence on 
the ascending reticular activating sys- 
tem, thereby leaving the animal more 
responsive to external stimulation. How- 
ever, there is not necessarily a one-to- 
one relation between events occurring 
within the central nervous system and 
overt behavior. Therefore, while 
arousal mechanisms within the central 
nervous system might provide a useful 
model on which to base a concept of 
drive, drive and arousal should not be 
considered as interchangeable con- 
structs, particularly since many of the 
antecedent conditions which define drive 
state do not seem to be reflected in 
changes in cerebral activity. 

In an effort to examine more closely 
the role of associative factors with re- 
spect to variations in activity, Sheffield 
and Campbell (1954) introduced con- 
trolled changes (an electric fan and 
lights) into the external environment 
prior to regular feeding. Greater ac- 
tivity was manifested by rats fed at 
the end of the stimulus period than by 
similarly deprived controls that were 
fed with no preceding environmental 
change. They interpreted this incre- 
ment in activity as an effect of frustra- 
tion, Such frustration was produced 
when the conditioned external changes 
occurred and when the goal, and con- 
sequently the consummatory response, 
was not possible. The stronger the con- 
nection between consummatory Te- 
sponses and change, the more frustrated 
(active) the organism will be when the 
changes occur and when there is no pos- 
sibility of the consummatory response. 
Amsel and Work (1961) repeated the 
experiment by Sheffield and Campbell, 
but included another group of animals 
which were fed on half of the stimulus- 
change trials. Partial reinforcement of 
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ivity, while producing changes, does 
E A in the sharply ascending ac- 
tivity curve which characterizes the 
100% group. Amsel and Work sug- 
gested that the 50% group did not have 
a sufficient number of trials to develop 
the 7, mechanism accounting for an- 
ticipatory goal reactions. Thus, the 
frustrating effects of nonreward were 
not as great in this latter group. 

But, it is not clear whether the frus- 
tration affects activity by directly in- 
creasing drive, or whether various motor 
responses are learned in connection 
with frustration-produced proprioceptive 
stimuli. One is inclined to wonder just 
how the notion of frustration-produced 
activity relates to the concept of sen- 
sitization, unless one accepts the im- 
plicit suggestion that deprivation sensi- 
tizes the organism to various cues which 
are frustrating, 

In the interests of parsimony, it 
might be suggested that this phenome- 
non can be explained without reference 
to states of “frustration” or of “ten- 
sion.” Some activity inevitably precedes 


eating and it is Suggested that this ac- 
tivity is reinforced by c 


responses. By virtue of fe 
mal, its mother or 


would seem to occur very early in the 
animal’s experience. Furthermore, ace 
tivities more and more remote in time, 
and consequently in adaptiveness, are 
linked together through a process of 
higher order conditioning and generali- 
zation. This should produce a chain of 
conditioned responses, as a function of 
the reinforcement contingencies and 
magnitude of deprivation, which would 


gradually build up to reach its pe in 
magnitude just before reinforcemen : 

The total stimulus complex to which 
the activity response is conditioned E 
sists of two sets of cues: those provide 
by the external environment and those 
which result from physiological mecha- 
nisms activated by the need state. For 
example, the introceptive cues sN 
ated with water deprivation would arise 
from stimulation in the pharyngeal re- 
gions, the stomach, and the hypothala- 
mus. The internal stimuli associated 
with a given level of need differ pon 
those of any other level of need, bot 
with respect to number and intensity. 
The greater the need state, the more 
intense and numerous the introceptive 
cues, 

Thus, changes in activity may be s 
function of the degree to which the tota 
maximal stimulus complex is represented 
at a given moment by the available cues. 
It would follow from these notions that 
the general activity level should be 
raised by changes in the external ioe 
ronment, but that deprivation should 
increase it even more. This, of gours 
is not what Campbell and Sheffield 
have suggested. Nevertheless, they do 
report significant increases in activity 
in the nonstimulated group. Another 
study (Hall, 1956) clearly substanti- 
ates our hypothesis. One might also 
predict that the shape of activity ai 
plotted as a function of length of f008 
deprivation would be different for samy 
lated and nonstimulated groups. ae 
Cause there are fewer maintaining eel 
uli in the latter condition, deceleratio” 
should occur more rapidly and an asymP 
tote reached sooner, The extinguishing 
effects of nonreward of activity shou 
be more rapid in the nonstimulatea 
group. Finally, one might hypothes! : 
that following eating, when the nee 
related cues are virtually elimina et 
activity level would be at its low 
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This particular phenomenon has fre- 
quently been observed. 

In some respects these suggestions are 
similar to those advanced by Estes 
(1958). Estes, however, refers to drive 
stimuli rather than to need-related cues. 

Finger, Reid, and Weasner (1957, 
1960) have demonstrated that manipu- 
lation of the reinforcement schedule 
alone modifies running activity. By 
feeding one group of subjects immedi- 
ately after removing them from the 
wheels and another group 60 minutes 
later, these workers were able to pro- 
duce differences not only with respect 
to total activity but also in terms of 
its pattern of development. Moreover, 
this phenomenon occurs under condi- 
tions of relatively low need. Hall (1958) 
has confirmed the finding by Finger 
and his associates and suggests that 
activity-wheel behavior is not a suita- 
ble indicator of drive strength when a 
deprivation-feeding cycle is used. Sew- 
ard and Pereboom (1955) have reported 
similar results. Apparently, activity- 
wheel running is subject to many of 
the same laws as other forms of instru- 
mental behavior, Again, in some of 
those studies in which exploratory be- 
havior has been related to deprivation 
states (e.g., Alderstein & Fehrer, 1955), 
learning has not been ruled out as an 
explanatory factor. Particularly im- 
portant here is the schedule of depriva- 
tion and ingestion with respect to meas- 
urement of exploration. In an attempt 
to control for the effects of deprivation 
Schedules, Glickman and Jensen (1961) 
have found that moderate degrees of 
hunger (22-hour period) do not pro- 
duce significant changes in exploratory 
behavior, but that a 46-hour period of 
deprivation does lead to an increase in 
Maze exploration. It is interesting to 
Note from their data that similar periods 
Of thirst did not result in significant 
Changes in exploratory behaviors. 

Even in the instance where there is 
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suitable experimental control for the 
reinforcing effects of posttest feeding 
on activity, increases in activity might 
still be attributable to learning by virtue 
of cues specific to a particular degree of 
privation. Thus, for example, the rat 
may have learned that a vigorous re- 
sponse to intense need-produced stimuli 
is more adaptive than a weak response, 
both in terms of time and amount. Jen- 
kins and Hanratty (1949) have shown 
that rats can learn to respond differently 
to different degrees of deprivation. In 
short, it is probably safe to say that 
the rats’ early experience with depriva- 
tion and feeding have provided that 
basis for learning to be active in re- 
sponse to cues associated with depriva- 
tion. 

In any event, Campbell and Shef- 
field’s (1953) interpretation could be 
an important one for a theoretical ap- 
praisal of activity since it conceives of 
activity as dependent upon external fac- 
tors and learning as well as upon need 
states. A further and perhaps more 
important implication is that activity 
can be viewed as directional, however 
inappropriate the response may appear 
to be. Activity, in other words, may 
not be spontaneous, but rather general- 
ized approach or avoidance behavior. 
It is possibly best to regard such be- 
havior as instrumental. If one holds 
to a drive interpretation of activity, then 
in terms of the results of the investiga- 
tions of Campbell and his associates it 
would appear that the role of drive is 
merely that of an alerting mechanism. 

Nevertheless, a portion of the litera- 
ture, if not directly refuting the sensi- 
tization hypothesis, at least fails to 
confirm it. Hall and Hanford (1954) 
controlled the external environment by 
holding light and temperature constant 
and by deafening their animals. They 
reported that wheel activity eee 
as much in the experimental animals 
as in the control group. Hall, Smith, 
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Schnitzer, & Hanford (1953) also found 
a relationship between activity and 
deprivation independent of external 
sources of stimulation. The results of 
a later study by Hall (1956) have al- 
ready been mentioned. Part of the 
diffculty in evaluating the literature 
stems from procedural differences both 
in terms of the actual measurements and 
also with respect to the conditions un- 
der which they were obtained. Surely, 
it is fairly well established that a rela- 
tionship exists between food deprivation 
and activity level, But, whether this 
occurs independently of the external en- 
vironment, of learning, and of other fac- 


tors, is another, and as yet, unresolved 
question. 


Foon versus WATER RESTRICTION 


Most of the literature which we have 


reviewed has dealt with food deprivation 
upon activity, What is the situation 
i Tespect to restriction of water? 
Finger and Reid (1952) found that 
water restriction affected Tunning ac- 
tivity as much as did food deprivation 
with performance Curves increasing con- 
tinuously under both conditions up to 
72 hours. A similar finding is reported 
by Stevenson and Rixon (1957). How- 
ever, the “satiation syndrome,” a term 
coined by Finger (1949) to describe 
the typical depression of activity follow- 
ing relief of deprivation, is less apparent 
under conditions of water restriction, 
This suggests that stimulation associ- 
ated with digestive activity may be a 
factor relevant to overt activity. 
There is a considerable lack of con- 
sistency, however, in the research find- 
ings with respect to comparisons of 
these need states. For instance, Hall 
(1955) did not observe any increase in 
running activity under prolonged re- 
striction of drinking. This confirms an 
earlier finding by Richter (1922). Simi- 
larly, Bolles (1959) reported increases 
in running related to food deprivation 


but not with respect to thirst. Stabi- 
limeter measurements indicate, aegon 
ing to Campbell (1960) that oa 
introduced just prior to drinking or eat 
ing result in heightened activity level, 
whereas water deprivation alone | 
duces no noticeable effect. This result 
is cited as further support for the sen- 
sitization-frustration theory. , 

It should be noted that a compleat 
ing factor arises in the employment e 
either food or water deprivation in tha 
when one of these is restricted, pane 
sumption of the other falls below ae 
intake. Thus a food-deprived animal, 
in terms of his base-line consumption of 
water, is also thirsty, and vice versa. 

In addition, there is no reason to sup- 
pose that a given period of food depri- 
vation is equal in its drive consequences 
(or in its stimulus value) to a —_ 
period of water deprivation, Dufort an 
Blick (1962) have shown that rats ad- 
just much more rapidly to water depri- 
vation than to food deprivation. It K 
clear that there is an abundance “1 
methodological problems present an 
any general statement regarding the 
relative effects of food and water depri- 
vation upon activity must be withheld. 
There is a critical need for research 
into the mechanisms by which the an 
ferent need states affect the interna 
receptors of the organisms in overlap- 
ping and in separate ways. 


SPECIFIC NUTRIENT DEPRIVATION 


Specific-nutrient-deprivation studies 
have been reviewed by Spector p 
Young (1957). In general, there seem 
to be an initial increment in activity 
related to internal and/or external ae 
ulation. This activity leads to ar 
depletion and, if there is no repair, | 
exhaustion with a decrease in actin 
(Wald & Jackson, 1944). However, t 
picture is not universally as simple 4° 
this. 


Reed (1947) and Munn (1950) have 
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indicated that activity is related to the 
percentage of proteins in the diet. Tran- 
sitory increases in activity seem to fol- 
low decreased protein intake. 

There are numerous examples of the 
effects of specifically restricted diets 
upon the activity of animals. Richter 
(1957) reported that rats are most 
active on a diet of carbohydrates and 
dextrose, less active on fats and lards, 
and least active on protein and casein. 
Animals deprived of thiamin and ribo- 
flavin, according to Wald and Jackson 
(1944), show an increase in activity. 
But when given sufficient amounts of 
these vitamins they tend to become 
more quiescent than the controls, Re- 
striction of vitamin B in the diet leads 
first to increased activity and then, 
after a few days, to lessened activity. 
On the other hand, deficiency of either 
vitamin A or vitamin D appears to re- 
sult in an almost immediate decrement 
in activity (Richter & Rice, 1942, 
1943). 

These divergent findings are trouble- 
some for drive interpretations of ac- 
tivity. While the composition of the 
diet does seem to influence gross bodily 
activity, the changes appear to be highly 
irregular. Moreover, there are some 
types of dietary deficiencies which re- 
sult in an immediate decrement in ac- 
tivity and some which have no apparent 
effect. These findings serve to indicate 
the complexity of the activating mech- 
anisms. While there is no requirement 
that drive and need be regarded in a 
One-to-one relation, neither is it very 
Satisfying to explain the absence of an 
increase in activity by referring to a 
Particular type of need which does not 
Initiate drive. Perhaps it would be more 
appropriate to refer to the stimulus 
Properties of the various need states. 
Indeed, it might be suggested that the 
Internal cues arising from total food 
deprivation are an accumulation de- 
riving from the numerous specific needs. 
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Thus, one might suspect that the in- 
ternal stimuli associated with food dep- 
rivation are more numerous, but more 
variable from moment to moment than 
those resulting from water deprivation. 
These conditions may account for the 
more rapid adaptation to water depri- 
vation as well as the greater increase 
sometimes reported in activity associ- 
ated with food deprivation, 


Sex 


Activity level appears to be related 
to sexual need, as defined by estrous 
variations. At the peak of the 4-5 day 
estrous cycle, running activity is several 
times as great as it is during other 
portions of the cycle (Finger, 1961b). 
However, when pregnancy occurs and 
there are no further periods of recep- 
tivity, activity drops to a consistently 
low level (Brobeck, Wheatland, & 
Strominger, 1947; Wang, 1923). After 
sex glands are removed, thus abolishing 
the female cycle, activity is reduced 
to a permanently low level (Richter, 
1927). 

Brody (1942) and Finger (1961b) 
have shown that, in comparison to 
males, female rats are four times as 
active in revolving drums and twice as 
active in stationary cages. While this 
difference may be explained, in part, 
by the increased activity of the females 
in estrus, the consistency of the dif- 
ference lends support to the view that 
other factors are involved. Apparently, 
this finding cannot be extended to 
humans. Wolfensberger, Miller, Foshee, 
and Cromwell (1962) have taken bal- 
listographic measures of activity in 
junior and senior high school students. 
They report that boys are more active 
than girls. Although this sex difference 
is a common observation in young chil- 
dren, cultural factors may be contribut- 
ing to the picture here. Schulman, Lip- 
kin, Clarinda, and Mitchell (1961) have 
obtained essentially the same results 
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ing the actometer as the measuring 
device. Childers (1935), on the: other 
hand, found no sex or race differences 
i ivity level. 
in | ann placed in an activity wheel 
next to a female will display peaks of 
activity related to the female’s estrous 
period. When a number of females are 
placed in his vicinity, the male’s ac- 
tivity increases, but it is at a more con- 
stant level (Reed, 1947). One explana- 
tion of this finding is that the females 
are in heat at different times and the 
male responds to all of them. A com- 
plicating factor here for exponents of 
drive-reduction theories is the finding 
that copulation does not serve to re- 
duce the running-wheel activity of the 
male (Morgan & Stellar, 1950). Nor 
does there seem to be much connection 
between running-drum activity level 
and the sex drive, as measured by the 
readiness of the male to copulate with 
receptive female (Stone & Barker, 
1934). In this instance, then, there 
does not appear to be any relation be- 
tween drive state and activity, at least 
as measured by the activity wheel. 
As contrasted with the relation of 
activity to nutrient deprivation, the 
Causative factors are less elusive in 
the female sex drive to activity. It is 
quite clear that sexual hormones play 
a substantial role. Both sexual recep- 
tivity and sexually linked activity are 
probably based on a direct effect to 
central structures by chemical condi- 
tions determined in the ovaries (Morgan 
& Stellar, 1950). Hormonal conditions, 
likewise, are probably implicated in sex- 
related activity of the male. 


NEED For ACTIVITY 


Up to this point, we have been dis- 
cussing activity as an indicator of need, 
The question arises, however, whether 
activity can also be a drive. That is, 
does a condition of restriction of move- 
ments for a period of time produce extra 


activity when the restriction is removed? 
The typical research paradigm followed 
is that of confining rats and studying 
the effects of the confinement on subse- 
quent activity. If there is such a state 
as an activity “drive,” then the confined 
animal should display more activity at 
the end of the confinement period, the 
level of activity being a function of the 
length of confinement. 

Some investigators have indicated 
that activity increases following confine- 
ment (e.g., Baron, Antonitis, & Beale, 
1961), but the effects are not clear-cut. 
Siegel (1946), for instance, found that 
24-hour confinement leads to an in- 
crease in activity, but 6- to 12-hour 
confinements were followed by decreases 
in activity. Siegel and Alexander 
(1948), in repeating Siegel’s (1946) 
earlier study, found that the relation- 
ship between enforced confinement and 
activity is dependent on the time of 
the day. The 6-hour confinement period 
has no effect if the confinement occurs 
from early to late morning, a period 
during which the rat is normally inac- 
tive. However, the depressing effect 
does occur if confinement is imposed 
between late morning and early eve? 
ning, when the laboratory rat is rela- 
tively more active. Hill (1956, 1958, 
1961), attempting to demonstrate the 
generality of the relationship, has shown 
that activity in the wheel, in a tilt appa- 
ratus, and in number of squares trav- 
ersed in an open field increases among 
tats according to the length of confine- 
ment within the range of 5 hours tO 
40 days. The age of the animal is ap- 
parently a contributing factor here, 
there being an interaction between age 
and length of deprivation. Hill con- 
cluded that his results support an ac- 
tivity drive conclusion. On the other 
hand, it should be added that the studies 
by Montgomery (1953a, 1953b) led him 
to suggest that activity deprivation pro~ 
duces no increase in exploratory be 
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havior. The weight of the evidence sug- 
gests that when animals are not allowed 
to behave in certain ways, the oppor- 
tunity to do so produces an increase in 
such behaviors. 

Apparently, behavior is often ener- 
gized in situations where the contribut- 
ing antecedent factors are as yet un- 
known. This is apparent in the play 
activity of animals (Beach, 1945). Rats 
will run although their needs seem to be 
well satisfied. Children are active even 
when physiological needs appear to be 
at a minimum. Monkeys evidently will 
learn with nothing more than manipula- 
tion serving as a reinforcement (Har- 
low, 1954). Exploration may motivate 
jearning (Montgomery, 1954), as does 
curiosity (Berlyne, 1955). Curiosity, 
manipulation, and exploratory behaviors 
are viewed by some as possible special 
cases of an activity drive (e.g., Morgan, 
1957). It should be pointed out that 
curiosity, manipulation, exploratory, and 
activity drives are overlapping constructs 
when they are reduced to operational 
terms. 

Kagan and Berkun (1954) and 
others have produced further support 
for a drive interpretation of activity 
by showing that it can serve as rein- 
forcement for learning. In their study, 
an activity wheel was designed so that 
the animal could release brakes on the 
wheel by pushing a lever. By releasing 
these brakes, the animal could turn the 
wheel for a period of 30 seconds until 
the brakes were reapplied. Animals un- 
der these circumstances responded con- 
siderably more than did rats in freely 
moving wheels. Unfortunately for a 
drive interpretation of activity, it has 
been shown (Hill, 1961; Montgomery, 
1954) that confinement does not lead to 
a substantially greater preference of 
an activity incentive. 

These experiments indicate that, in 
spite of the lack of nutrient-based cues 
or intense sensory stimulation, a need 
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for activity seems to be manifested. 
However, it is not clear what role as- 
sociative factors play. Conceivably, 
cues arising from confinement or inac- 
tivity (need for exercise) may be asso- 
ciated with various types of overt motor 
responses. For instance, normal circu- 
lation of blood is partly dependent upon 
muscular contraction. 

There can be little doubt that the 
need states resulting from food, water, 
and sex restriction through their cen- 
tral mechanisms, energize sensory-motor 
systems. Some of these same structures 
may be involved in the activity drive. 
There is considerable literature (Linds- 
ley, 1957) which indicates that the 
reticular activating system and the hy- 
pothalamus are involved in the activity 
need. Vertebrates with damage in these 
areas show altered activity levels. Ani- 
mals with lesions in their “sleep center” 
show increased activity. Damaged 
“wake center” animals show lessened 
activity. Insult to the frontal lobe and 
in the area of the corpus striatum is 
associated with increased activity level. 
The work of Campbell and his asso- 
ciates (Campbell & Sheffield, 1953; 
Sheffield & Campbell, 1954; Teght- 
soonian & Campbell, 1960) suggests 
the involvement of the ascending retic- 
ular activating system as a mediating 
center. 


METHODOLOGICAL PROBLEMS 


As indicated earlier in this review, 
a portion of the conflicting evidence 
might be explained on the basis of differ- 
ences in methodology. The devices most 
frequently used in measuring activity 
are the stabilimeter and the wheel. Each 
of these has undergone many modifica- 
tions. Only a limited amount of investi- 
gation of the interrelations among vari- 
ous measures of activity has been at- 
tempted. The information which is 
available is sufficient to suggest that 
activity level should not be viewed as a 
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: or homogeneous phenomenon. 
TN writers (e.g., Morgan & Stellar, 
1950; Reed, 1947) have suggested sev- 
eral dimensions of activity—locomotor 
versus diffuse, relevant versus irrele- 
vant, and goal-directed versus non-goal- 
directed. Reed (1947) comments that 
“what data we have point to more than 
one type, or at least more than one as- 
pect, of activity.” Hunt and Schlosberg 
(1939) have made a similar statement. 
How do the various measures com- 
pare with one another? Some studies 
(Bolles, 1959; Furchgott & Echols, 
1958; Hayes, 1957; Stern, 1957) in- 
dicate that open-field behavior is distinct 
from wheel running and stationary cage 
activities. Hunt and Schlosberg (1939) , 
studying castration, and Tainter (1943), 
investigating various drugs, reported 
that their experimental treatments dif- 
ferentially affected stabilimetric and 
running-wheel activity. More recently, 
Strong (1957) has found that food dep- 
rivation will lead to an increase in 
diffuse activity as recorded by a sensi- 
tive contact stabilimeter, but if a micro- 
switch stabilimeter is employed, food- 
deprived rats do not difi 


fer significantly 
in activity from their controls or in 


relation to predeprivation level. Alon 
these same lines, Finger (1958) meas- 
ured activity with both the revolving 
drum and photoelectric techniques and 
found that the drum measure was more 
influenced by increases in food depri- 
vation. In another study, Weasner, 
Finger, and Reid (1960) have shown 
that with different deprivation regimens 
the running increases proportionately 
more than does activity in a stationary 
device. Treichler and Hall (1962) have 
compared activity in the wheel, stabi- 
limeter, and Dashiell Maze as a func- 
tion of body weight. They concluded 
that the three measures do not produce 
interchangeable data, the stabilimeter 
and maze scores showing little change 
as a result of weight loss. 
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The picture is much the same when 
need variables other than food restric- 
tion are used. For instance, Finger 
(1961b) has demonstrated that wheel 
activity is more responsive to estrous 
variations than stationary cage activity. 
A further complication is introduced by 
the finding that the difference beteen 
these two measures, as a function D 
estrus, is not as great as the disparity 
noted when the animals are subjected to 
food deprivation. In other words, we 
may infer a triple-order interaction be- 
tween need states, degree of privation, 
and measuring device. 

Strong (1957) has claimed that the 
stabilimeter type of measurement used 
by Campbell and Sheffield (1953) de- 
tects and records relatively more diffuse 
activity of fine movements than some 
of the methods employed by others. 
Hunger appears to increase locomotor 
activity as measured by the wheel and 
photoelectric cages, but seems to have 
considerably less effect on stabilimeter 
activity. On the other hand, it may be 
that external stimulation increases rest- 
less behavior to which the stabilimetet 
is more sensitive. Thus, it seems that 
the conflicting results of Campbell and 
Sheffield (1953) and Hall (1956) are 
explicable on the basis of differences in 
the recording of activity changes. In 
this connection, it may be argued that 
wheel rotations give a superior method 
of quantification due to the inherent 
Specificity of response. Two units © 
increase, for example, in an activity 
wheel allow one to assume that their 
combined “activity” is approximately 
twice as much as a l-unit increase- 
However, in the stabilimeter this 45° 
sumption is not nearly as tenable va 
cause the score producing response act 
themselves are not truly additive. ches 
argument would apply to the photoele¢ 
tric devices as well. P 

Another feature which should re 
noted is the relative similarity of th 
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stabilimeter to the home environment 
of the typical laboratory rat. The re- 
sponses which the rat is physically able 
to make are virtually identical in the 
two situations. However, the wheel 
makes it possible for the animal’s be- 
havior to take more complete and less 
restricted locomotive forms, rather than 
limiting movements to postural re- 
sponses. This difference may very well 
account for the finding (Eayrs, 1954) 
that habituation to the stationary cage 
occurs much more rapidly than adapta- 
tion to the revolving drum. 

A related methodological problem is 
the determination of the base line for 
the activity of a given animal. By first 
measuring the normal activity of the 
organism it is possible to estimate more 
precisely the effect of a particular treat- 
ment. The base line is usually considered 
established when there is no consistent 
change over a period of time, inde- 
pendent of the normal cyclic behavior 
and aging process. When the habitua- 
tion requires an extended period, this 
latter factor might be important, al- 
though it is rarely considered. 

There is considerable interexperi- 
menter variability with respect to what 
is considered necessary to establish a 
suitable base line. Campbell and Shef- 
field (1953), for example, used much 
shorter intervals (10 minutes a day for 
7 days) than most other researchers. 
Finger (1961a) and Reid and Finger 
(1955) have demonstrated that base- 
line activity in the running drum takes 
at least 30-35 days to reach a stable 
level. Moreover, the cyclic feeding ex- 
periences of the rat before it has ever 
come into contact with the wheel com- 
plicate the matter. If the animal is 
placed on cyclic deprivation prior to 
wheel running, the base-line function 
becomes fairly level within 15 days 
(Reid & Finger, 1957). The time 
required to establish a reliable base 
line partially dependent upon the 


apparatus employed—stabilimetric-type 
cages requiring only a few days. In 
most studies, at least a portion of the 
variance in activity scores is probably 
due to habituation factors. 

Still another factor which must be 
considered in an attempt to compare the 
different experiments relates to differ- 
ences in deprivation schedules. Among 
the determinants of such schedules are 
the amounts of food permitted at a 
time, the interval between feedings, and 
feeding in relation to the time of activity 
measurement. Hall and Connon (1957) 
predicted that different schedules, with 
the same amount of deprivation, would 
result in different amounts of activity. 
They found substantial differences in 
activity as a function of the method of 
handling the deprivation schedule. There 
is some evidence to suggest that the ratio 
of weight loss to predeprivation weight 
is more crucial than deprivation per se. 
Moskowitz (1959) was among the first 
to report that activity does not increase 
until body weight has decreased to 85- 
90% of normal. After this point there 
is almost a perfect correlation between 
wheel running and deprivation (prior to 
inanition). 

Evaluation of the relationships or 
lack thereof between need states and 
activity level becomes all the more 
difficult when organismic variables are 
considered. Species differences (DeVito 
& Smith, 1959), strain differences (Hol- 
lifield & Parsons, 1957; McClearn, 
1960), sex (Baba, 1958; Finger, 
1961b), and age (Finger, 1962) have 
been shown to interact with either the 
need-activity relationship or with the 
measuring technique, or with both, In 
his rather extensive study of the effects 
of varying degrees and types of depriva- 
tion on different behaviors, including ac- 
tivity measures, Bolles (1959) comes to 
the conclusion that individual perform- 
ance is dependent upon situational fac- 
tors. There is very little communality 
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of performance across different situa- 
tions and conditions of deprivation. 
Bolles (1959) states: “AN of the evi- 
dence supports the hypothesis that mo- 
tivated behavior is controlled chiefly by 
associative factors [p. 585].” : 
It might be well in studying activity 
to attempt a more refined measurement 
of the various types of movements the 
animal is making under definable need 
states. Perhaps such studies might re- 
veal that what we now call general ac- 
tivity level is merely an artifact of in- 
adequate measuring methods, and that 
activity is actually highly specific. 
Close analysis of the Separate responses 
which make up the activity score may 
very well show that the nature of the 
activity being manifested is a function 
of the stimulus effects of different need 
states, 
Bindra (1961) has recently attempted 
a formulation which is more concerned 
with the specificity than the generality 
of activity. He points out that con- 
Summatory acts must be considered in 
relation to the occurrence or nonoccur- 
rence of the more pervasive and fre- 
quent spontaneous acts which comprise 
general activity level. He pointed out 
that the roles of irrelevant and relevant 
activity should not be ignored in analyz- 
ing the total activity matrix, Failure to 
do this may very well explain some of 
the low correlations between different 
measures of activity. For relevant and 
irrelevant response acts may display 
different prominence in each measure. 


CONCLUSIONS 


There can be no doubt that conditions 
of need such as food, sex, rest, etc., 
influence the activity level of the organ- 
ism. Most of the organic conditions are 
correlated with increases in general ac- 
tivity level. This phenomenon has been 
demonstrated many times. Moreover, 
increment in activity, up to a point, are 
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correlated with the degree of deficit. 
However, there are two crucial problems 
which as yet have no definite solutions. 
One involves the circumstances under 
which the relationship remains invari- 
ant, and in effect, constitutes the 
broader problem of definition of ac- 
tivity. The other one concerns the 
process by which need states are re- 
flected in changes in activity. 

The research findings reviewed in 
this paper are sufficient to indicate that 
any attempt to write a formula for 
activity output in terms of a known 
level of deprivation would also have to 
take into account the measuring device, 
the type of deprivation, the schedule of 
deprivation, the immediate environ- 
ment, the experiences of the organism, 
the sex, age, and species (probably 
Strain also), and the interactions be- 
tween these factors. Activity, far from 
being a simple indicator of drive, is 
probably as complex a phenomenon as 
we have ever undertaken to measure 1n 
the laboratory. Instead of dealing with 
activity as a homogeneous, unitary 
construct, we suggest that psychologists 
should be devoting more efforts to the 
discovery of the parameters of various 
subclasses of activity, rather than as- 
suming that they are measuring some- 
thing which is general in nature and 
unitary. It is doubtful that anything is 
to be gained by attempting explanatory 
constructs that will cover the wide range 
of behavior included in this broad cate- 
gory, and the conditions under which 
these behaviors develop. 

Probably the most commonly accepted 
view of need-related activity is that it 
represents the energizing function of 
drive. Accordingly activity has fre- 
quently been used to infer the presence 
of heightened drive state. However, 10 
many instances, particularly in rela- 
tion to food and water deprivation, a? 
associative interpretation can also be 
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made. In fact, we suggest that as more 
is discovered about the role of learning 
with respect to so-called “spontaneous 
activity” there will be less need to refer 
to drive concepts. The use of constructs 
such as “drive” to explain certain be- 
haviors seems to be inversely related to 
an understanding of the conditions un- 
der which these behaviors develop. We 
do not believe that the concept of drive 
will survive the discovery of the nature 
and the measurement of the antecedent 
organismic conditions which determine 
an increase in the general innervation 
of muscles. Learning may be the source 
of some of these conditions. 
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Taylor (1963) pointed out (a) results of studies of universal phonetic 
symbolism in natural languages were contradictory (b) Taylor and Taylor 


meanings to the same sounds. She argued, therefore, that current theories 
of phonetic symbolism, which imply its universality, must be revised, and 


offered a theory based upon the occurrence of certa 


dominantly in some meaning categories than in othe 
however, that Taylor overlooked important meth 
“contradictory” studies and did not take into acco 
to those of Taylor and Taylor. In addition, the “la 


is examined and variou; 
pointed out. 


In a recent review of the literature, 
Taylor (1963) argued for a basic reyi- 
sion of phonetic symbolism theories be- 
cause they imply that the phenomenon 
is universal and experimental evidence 
has failed to bear this out. Specifically, 
Taylor argued that theories which postu- 
late that the sound-meaning relations to 
which phonetic symbolism refers are 


lage, attach the 
s. However, the 
ned to test this, 


have, according 
to Taylor, been either inconclusive or 


directly in opposition to the hypothesis. 
For example, in one type of study of 
universal phonetic symbolism in natural 
languages, subjects may be asked to 
guess the meanings of foreign words by 
matching the corresponding members 
of word pairs in the subjects’ native 
language and in some noncognate for- 
eign language. Where no structural 
cues can be demonstrated, correct 
guessing is assumed to be based upon 
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in sounds more pre- 
ts. It is pointed out, 
odological factors in 
unt findings contrary 
nguage habits” theory 


s untested assumptions and other difficulties are 


the subjects? response to the meaning 
implications of sounds of the foreign 
words. In two out of six such studies 
reviewed by Taylor (Brackbill & Little, 
1957; Maltzman, Morrisett, & Brooks, 
1956), essentially negative results led 
the investigators to conclude that pho- 
netic symbolism was no symbolism at 
all. In a second type of study, designed 
to test the universal-phonetic-symbolism 
hypothesis directly, Taylor and Taylor 
(1962) asked subjects in four different 
language communities to rate 144 non- 
Sense syllables on four dimensions of 
meaning. Results seemed to indicate 
that the Same sounds had different 
Meanings for people who speak different 
languages, 

On these grounds, then, Taylor 
(1963) postulates that phonetic symbol- 
ism, rather than being universal, must 

e culturally delimited and therefore 
based upon “some factors that are dif- 
ferent from one language (community) 
to another unrelated language (com- 
munity) [p. 206].” She postulates that 
the likely factor is language habits. The 
Specific language habit which Taylor 
Proposes derives from what she be- 
lieves is an established observation, 
namely, that words sharing a common 
connotation tend to have the same ini- 
tial consonant, This consonant, accord- 
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ing to Taylor, through association with 
the connotation of the words in which 
it occurs takes on that connotation. 
Thus, English-speaking people learn 
that c is a “big sound” because in Eng- 
lish c is more often the initial consonant 
of words connoting “bigness” than of 
words connoting “smallness.” Similarly, 
T connotes “smallness” because it is 
more often the initial consonant of 
“small words.” 

Having summarized Taylor's position, 
we may proceed first to examine the 
grounds on which Taylor’s objection 
to current theories is based and then 
undertake to evaluate Taylor’s “lan- 
guage habits” theory in its own right. 


THE GROUNDS FOR TAYLOR’S 
OBJECTION 


First, the negative results in the word- 
matching studies of Brackbill and Little 
(1957) and Maltzman et al. (1956) 
may be considered. In the opinion of 
the present author these results are 
taken uncritically by Taylor as evi- 
dence against universal phonetic symbol- 
ism. For, as Brown and Nuttall (1959) 
and Weiss (1963) have pointed out, 
both Brackbill and Little and Maltzman 
et al. introduced important procedural 
variables which may have accounted for 
their results. Thus, Brackbill and Little 
asked subjects to tell if a single stimu- 
lus and a single response word were the 
“same” or “different” in meaning, and 
thereby effectively reduced to one fourth 
the amount of information available to 
the subject in this situation as opposed 
to one in which word pairs were used. 
Maltzman et al., on the other hand, 
used foreign words in both the stimu- 
lus and response categories, and Weiss 
(1963) has shown that it is necessary 
that the words in at least one of the 
categories be meaningful to the subject. 
Nowhere does Taylor consider these 
points. On the contrary, her observa- 
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tion that “the more recently the ex- 
periments were done the less the degree 
of positive results,” and her suggestion 
that “perhaps more recent experiments 
have tended to eliminate sources of 
spurious positive findings [1963, p. 
201]” may betray a bias. Thus, the 
possibility of “spurious negative find- 
ings,” if findings can ever be considered 
spurious at all, is not considered in Tay- 
lor’s evaluation of the evidence. 
Second, as far as the Taylor and Tay- 
lor (1962) study is concerned, the find- 
ings do indeed present a difficulty for the 
concept of universal phonetic symbol- 
ism. Thus, in word-matching studies 
the interpretation of correct guessing is 
that while different languages may use 
different sounds to convey the same 
meaning, the symbolic values of the 
sounds in a foreign word are universally 
shared and therefore make it possible 
for nonspeakers of the foreign language 
to guess the meaning of that word. Tay- 
lor and Taylor’s findings, however, make 
this seem an unlikely possibility. Is 
there an explanation, then, which can 
reconcile both findings without incor- 
porating the notion of universal phonetic 
symbolism? One attempt has been 
made by Brackbill and Little (1957) 
who argue that the choices in word- 
matching experiments are based upon 
subjects’ perception of structural simi- 
larities between the words guessed to 
have the same meaning. If this were 
true, it would indeed argue against the 
appropriateness of considering the sub- 
jects’ responses in that paradigm as be- 
ing based upon phonetic symbolism. 
However, it must be recognized that 
while the argument may explain sig- 
nificant agreement, it does not explain 
correct agreement. In other words, it 
does not explain how it happens that 
different languages make use of simi- 
larly constructed words to convey the 
same meaning. In fact, it is the opinion 
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of the present author that structural 
similarity in historically unrelated words 
of the same meaning is in itself a potent 
argument for some process of symbol- 
ism. Therefore, unless one were willing 
to grant that the positive findings of 
word-matching studies are indeed spu- 
rious, an alternative theory remains to 
be advanced which could accommodate 
both positive findings in word-matching 
studies and findings such as those of 
Taylor and Taylor (1962). It should 
be noted, however, that there are at 
least two very recent findings which 
are in some measure contrary to those 
of Taylor and Taylor. Davis (1961) 
found that African children who had 
never been exposed to English tended 
to assign the nonsense words TAKETE 
r and rounded 


Japa- 
ue that Miron’s 
(1961) Japanese subjects had varying 


degrees of familiarity with English, he 


to phonetic Properties proves to be highly simi- 
lar across two contrasting linguistic groups sug- 
gests that the laws governing phoneti 
ism may have a universal charact 


In light of these findings one is led to 
speculate that since the theory of pho- 
netic symbolism does not specify that 
a single meaning becomes associated 
with a single sound, but rather that 
many meanings may become hierarchi- 
cally associated with a sound, and vice 
versa, the hierarchies established by 
different cultural groups may differ, pos- 


sibly in part as a result of exposure to 
such language habits as Taylor (1963) 
describes. Thus, in the Taylor and Tay- 
lor (1962) study the meaning attached 
by each language group to a given sound 
may simply have been the dominant 
meaning in its hierarchy of associations 
to that sound. While this explanation 
is not entirely satisfactory in light of 
the fact that Taylor and Taylor (1962) 
found no appreciable correlations across 
language groups between the ratings of 
many sounds on the same dimension, it 
Seems to offer at least the core of a 
reconciliation, In any case, it is quite 


Clear that there is a need for further 
exploration. 


THE Lanouacr-Hasirs THEORY 


Since it is possible that alternative 
explanations may exist at this time for 
the data of Phonetic-symbolism studies, 
Taylor’s language-habits theory of pho- 
netic symbolism deserves to be evalu- 
ated in its own right, While at first 
glance the theory seems to be a plausible 
explanation for the Taylor and Taylor 
(1962) findings, it has a number of 
difficulties which need to be considered. 
First of all, the theory as stated begs 
the question of how it comes about in 
the first place that unrelated words in 
a given language which share a con- 
notative meaning tend also to share 
structural components, It seems that 
rather than phonetic symbolism being 
the result of Such linguistic phenomena 
ed by Taylor, just the re- 
verse may be true. That is, because of 
the symbolic values acquired by sounds 
through extralinguistic associations, dif- 
ferent words which represent a meaning 
category, or refer to it connotatively, 
draw upon the Same sounds because of 
the association of those sounds to that 
Category. As it Stands, Taylor’s theory 
Xplain the observation of such 
People as Jespersen (1922) that some 
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word types, for example, words of ac- 
tion, words related to sounds, and words 
of imitation, seem to have more phonetic 
symbolism. The hypothesis of extra- 
linguistic associations, on the other 
hand, could account for it quite well. 
A second difficulty of Taylor’s theory 
is that while it proposes that phonetic 
symbolism is based upon an association 
between word meanings and the initial 
consonants of words, it offers no reason 
why initial consonants should be the 
ones to carry the burden of the proposed 
associations. Thus, Horowitz (1955) 
has pointed out that in most Indo- 
European languages it is the final sylla- 
ble or suffix which usually carries the 
adjectival component of a word. Why 
then do speakers not learn to associate 
the final sounds of a word with its con- 
notations? Furthermore, Taylor offers 
no evidence that initial consonants of a 
word in a meaning category are in fact 
more frequently similar than are any 
other components of those words. 
Third, the evidence offered by Taylor 
that a larger number of words of a given 
connotation have the same initial con- 
sonant than do words of the opposite 
connotation, is open to the very criti- 
cism of selectivity that Taylor rightly 
levels against such people as Paget 
(1930) who showed that some words 
are constructed in a manner consistent 
with what their theories would predict. 
The words which do not fit the model 
are never mentioned. Thus, to take the 
list of words offered by Taylor, it is 
true that there are many more big 
words beginning with the c sound or K 
sound (GARGANTUA, GLARING, GOODLY, 
GRAVE, GREAT, GROSS, CAPACITY, CARGO, 
COLOSSUS, COMPREHENSIVE, CONSIDER- 
ABLE, CONSUMMATE, CORPORATION, COR- 
PULENCE, CRASS, and QUANTITY) than 
there are small words having those initial 
Consonants (GLEAM, GLOBULE, GRAIN, 
GRUB, CLOSE, CONTRACT, CORPUSCLE, 


CRUMB, and CRUMBLE) and Taylor’s 
hypothesis would seem to be borne out. 
However, one can easily think of a 
sufficient number of small words begin- 
ning with G and K (e.g., CRIB, CRAWL, 
CAN, GRAM, GUN, GLASSES, and so on) 
which, if included in the list, would 
cause the numerical difference to van- 
ish, if not, in fact, to be reversed. The 
selection of words as it now stands, is 
entirely arbitrary. 

On the whole, then, the promulga- 
tion of a new theory of phonetic symbol- 
ism on the grounds put forward by 
Taylor should at least await further 
confirmation of those grounds. However, 
Taylor’s work serves well to indicate 
the necessity for further cross-cultural 
work on universal phonetic symbolism. 
In fact, more systematic study of the 
parameters of phonetic symbolism as a 
response capacity is similarly indicated. 
Unfortunately, at this time there are too 
many cases being made in defense of too 
few facts. 
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LEARNING AND PSYCHOPATHOLOGY 


FREDERICK W. HUFF2 
University of Georgia 


Experiments concerning the learning of abnormal human Ss are reviewed. 
The review includes discussion of how persons with particular types of 


psychopathology learn 


different kinds of tasks as well as the reasons 


suggested for the learning phenomena found. 


This paper is an attempt to bring to- 
gether most of the learning experiments 
using abnormal human subjects. It has 
two purposes: to provide a review of the 
literature and to bring to light seemingly 
valid generalizations. 


LEARNING OF PsycHorics 
Schizophrenics 


Paranoids. No studies were found 
which investigated exclusively the learn- 
ing of paranoid schizophrenics. Huston 
(1932), Huston and Shakow (1936), 
and Shakow (1932) have demonstrated 
that of the schizophrenic subtypes, para- 
noids are more like normals than the 
others, although they are somewhat in- 
ferior to normals. The tasks employed 
were the prod meter (Shakow, 1932), 
speed of tapping (Huston & Shakow, 
1936), and the pursuitmeter (Huston, 
1932). 

Several studies (O’Connor & Rawns- 
ley, 1959; Pfaffman & Schlosberg, 1936) 
utilizing classical conditioning have 
failed to duplicate these results. Pfaff- 
man and Schlosberg (1936), in condi- 
tioning the knee jerk, found no differ- 
ences between the subcategories of 
schizophrenia, Using eyelid condition- 
ing, O’Connor and Rawnsley (1959) 
also demonstrated that paranoids did 
not differ from other schizophrenics. 
However, Pfaffman and Schlosberg 
(1936) showed that paranoids condi- 
tioned more poorly than normals, 


A Acknowledgments are due Robert E. Tay- 
lor for his able editorial assistance. 


whereas O’Connor and Rawnsley (1959) 
found no difference. These conflicting 
results could be due to subject variables 
other than paranoia. To add to the con- 
fusion, O’Connor (1957) and O’Connor 
and Rawnsley (1959) demonstrated on 
other tasks (crossing out odd and under- 
lining even numbers and verbal condi- 
tioning, respectively) that paranoid 
schizophrenics were slow learners when 
compared with nonparanoid schizo- 
phrenics. 

That paranoids may perform uniquely 
on learning tasks is supported by the 
fact that some investigators are careful 
to exclude them when studying the 
learning of other schizophrenics (Tizard 
& Venables, 1957; Venables & Tizard, 
1956b, 1956c). Further, it has been 
shown that on a complex reaction-time 
task nonparanoid schizophrenics im- 
proved when an extraneous stimulus 
(white noise) was introduced, while 
paranoids did not (Tizard & Venables, 
1957). 

Hebephrenic and Mixed Types. In 
the Huston and Shakow studies men- 
tioned above, hebephrénic and mixed 
schizophrenics were found to be inferior 
in performance to paranoids. However, 
the relationship did not hold for classical 
conditioning (Pfaffman & Schlosberg, 
1936). 

Catatonics. Catatonic schizophrenics 
are the poorest learners among all the 
schizophrenic subtypes on the tasks em- 
ployed in the Huston and Shakow 
studies, but their conditionability was 
the same as the other subtypes (O’Con- 
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nor & Rawnsley, 1959; Pfaffman & 
Schlosberg, 1936). However, Shipley 
(1934), using GSR conditioning, indi- 
cated that catatonics conditioned better 
than normals. , 
Unspecified. This section deals with 
studies in which the type of schizo- 
phrenia is unspecified. It is likely that 
the subjects were a mixture of all types. 
Nevertheless, it would not be correct to 
think that the subjects were representa- 
tive of the total schizophrenic popula- 
tion. They were chosen on the basis of 
availability and capacity to participate. 
Also, because of the nonspecificity of the 
variables defining schizophrenia it is dif- 
ficult to compare studies. Since there 
may be differences in the performance 
of the schizophrenic subtypes on learn- 
ing tasks, the present studies are thus 
confounded in not taking this into ac- 
count. However, since with one ex- 
ception (O’Connor & Rawnsley, 1959), 
the available data indicate that all 
schizophrenics are poorer on 
tasks than normals, they per! 
only in the magnitude of the 
deficit found, 
Kent (1911), 
learning tasks, wa 
demonstrate that 
learn. Boring (191 


learning 
haps err 
learning 


using many verbal- 
s one of the first to 
schizophrenics could 


3) confirmed this, but 
found that schizophrenics were inferior 


when compared with normals. This de- 
ficiency also occurred in paired-associate 
learning (Atkinson, 1958; Hall & 
Crookes, 1951; Hull, 1917; Robinson, 
1958), but retention did not seem to be 
impaired (Hull, 1917), 

There has been some exploratory 
work on the verbal conditioning of 
schizophrenics. There are no compari- 
sons of schizophrenics with normals 
(Cohen & Cohen, 1960; Salzinger & 
Pisoni, 1958). Salzinger and Pisoni 
(1958) were able to condition “I” ang 
“we”? responses in schizophrenics using 
agreement as reinforcement. Cohen and 
Cohen (1960) tried to condition first- 


and second-person pronouns and were 
unable to do so. However, they used a 
different reinforcer (the word “good”) 
and failed to control variables known A 
affect the learning of schizophrenics; 
for example, cooperation, severity of ill- 
ness, etc. Thus, it is difficult to compare 
the two studies. More recently, Salz- 
inger and Pisoni (1961) have duplicated 
their earlier findings. À 
Reaction-time studies have been in- 
cluded here because of their obvious re- 
lationship to motor learning, and be- 
cause they show changes with prac- 
tice, and thus can be considered as 
affected by learning. (The reader is re- 
ferred to King, 1954, for a general treat- 
ment of the psychomotor aspects © 
mental disease.) Many studies (Huston 
& Shakow, 1936; Huston, Shakow, & 
Riggs, 1937; Rodnick & Shakow, 1940; 
Venables & Tizard, 1956c) have reliably 
shown that schizophrenics have a slower 
reaction time than normals, and that 
with practice little improvement occurs- 
This holds for a wide variety of reac- 
tion-time tasks, ranging from simple 
tapping, to manipulating complex 
Switches to cues provided by different 
colored lights. Generally, as intensity 
of cue stimulus increases so does speed 
of reaction time; however, Venables and 
Tizard (1956b) found that with chronic, 
long-term schizophrenics the reverse was 
true where the cue stimulus was a light- 
However, on further investigation this 
“paradoxical” effect could not be dupli- 
cated (Venables & Tizard, 1956a)- 
On the pursuitmeter, schizophrenic? 
Started at a lower initial level than nor 
mals and improved with increasingly 
smaller increments, the curves appear’ 
ing more negatively accelerated tha 
with normals (Huston, 1932; Huston 
Shakow, 1948). Peters (1956), usins 
circular pencil mazes, found schizo 
phrenics poorer in both time and erior 
The operant conditioning of arly 
motor behavior in schizophrenics is fai" 


x J 


LEARNING AND PSYCHOPATHOLOGY 461 


recent, They had a slower rate of acqui- 
sition than normals and under condi- 
tions of reinforcement they had long 
periods of nonresponding. In a few sub- 
jects the conditioned response was found 
to be extremely resistant to extinction, 
even after many hours of nonreinforce- 
ment (Lindsley, 1956). Bullock (1960) 
also indicates that schizophrenics are 
inferior to normals on operant condi- 
tioning. 

The empirical findings using the 
classical conditioning paradigm are con- 
tradictory as to the conditionability of 
the schizophrenic, especially among the 
subcategories (O’Connor & Rawnsley, 
1959; Pfaffman & Schlosberg, 1936). 
Howe (1958) found conditionability of 
psychogalvanic response (PGR) to be 
no different in schizophrenics and nor- 
mals, while Peters and Muphree (1954) 
collected evidence to the contrary. In 
a review of the relevant Russian litera- 
ture, Lynn (1963) cites findings which 
support Howe’s conclusions. z 


Depressives 


Only a few studies (Brengelmann, 
1957; Venables, 1959; Venables & 
Tizard, 1956c) have included “psychotic 
depressives,” and in these the criteria for 
selection of subjects varies. Two on re- 
action time (Venables & Tizard, 1956c; 
Venables, 1959) indicate that depres- 
sives start at a lower level than normals 
(even lower than schizophrenics), but 
that they finally achieve a level com- 
mensurate with normal subjects. In an 
investigation where the task was to re- 
produce visual patterns after a short 
exposure, error scores distributed bi- 
modally, perhaps suggesting two types 
of depressives (reactive and endoge- 
nous). Depressives were inferior to nor- 
mals on this task (Brengelmann, 1957). 


Manic-Depressives 


Manic-depressives, included in two of 
the Huston and Shakow studies (1948, 


1949), had no impairment in learning. 
On GSR conditioning they conditioned 
somewhat better than normals (Shipley, 
1934). Similar findings are reported by 
Pfaffman and Schlosberg (1936) in con- 
ditioning the knee jerk, 

Unspecified 


A number of workers (Mailloux & 
Newburger, 1941; Mednick & Lindsley, 
1958; Spence & Taylor, 1953; Taylor & 
Spence, 1954) used psychotic patients 
without any further specification as to 
subject population. Since there appears 
to be differences in the learning of dif- 
ferent types of psychotics, this kind of 
study almost evades interpretation. 

On a discrimination reaction-time 
task, psychotics were able to learn to 
discriminate between colored lights 
(Mailloux & Newburger, 1941), but be- 
cause of the heterogeneous population 
and lack of control group, it is uncertain 
just what this means. On two occasions 
of eyelid conditioning it was demon- 
strated that a psychotic group could be 
conditioned better than normals (Spence 
& Taylor, 1953; Taylor & Spence, 
1954). The reason for these findings is 
unknown. It might be that the psychotic 
group included many manic-depressive 
subjects who have been shown to condi- 
tion better than normal subjects 
(Shipley, 1934; Pfaffman & Schlosberg, 
1936), or it might be that schizophrenics 
who presumably make up the bulk of 
any psychotic group do condition better 
than normals, but this seems unlikely in 
face of the evidence already cited. Med- 
nick and Lindsley (1958) report much 
response variability in operant motor 
conditioning with psychotics. 
Variables Affecting the Learning of 
Psychotics 

The studies to be reviewed here were 
chosen because they explored, experi- 
mentally, factors which might explain 
the learning phenomena presented above. 
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Strictly theoretical papers were omitted. 
These studies were primarily with schizo- 
phrenic subjects, and thus, most of the 
variables apply to schizophrenic learn- 
ing. They might also account for the 
learning of other abnormal subjects, but 
the empirical evidence is lacking. The 
variables mentioned here do not in any 
way exhaust the theoretical possibilities. 
Interference. Three investigations 
(Bleke, 1954; Lang & Luoto, 1962; 
Mednick & Lindsley, 1958) have demon- 
strated that intrusion of irrelevant 
(wrong) responses is greater in the 
learning of psychotics and schizophrenics 
than normals. One of these (Bleke, 
1954) has gone further and shown that 
reminiscence is greater for schizo- 
phrenics than normals, thus giving in- 
direct evidence of increased interference. 
He had schizophrenics learn a temporal 
maze consisting of switches where the 
discriminative stimuli were words, and 
found that reminiscence was greater for 
schizophrenics with the poorest pre- 
morbid adjustment, but not for the 
others, A normal control group showed 
less reminiscence than those with poor 
premorbid adjustment, but not the re- 
maining schizophrenics, This suggests 
another variable, severity of illness, 
which might be a factor in the learning 
of schizophrenics; the experimental evi- 
dence on this point will be reviewed 
later. Psychotics in general made more 
irrelevant and nonpurposive responses 
than did normals on Operant motor con- 
ditioning (Mednick & Lindsley, 1958). 
Lang and Luoto (1962) tested the ver- 
bal mediation hypothesis using the 
paired-associate method, Schizophrenics 
showed more intrusions of previously 
learned words than did normals. It ap- 
pears that remote associations are greater 
in schizophrenics, although this hy- 
pothesis has not been tested directly. It 
seems that schizophrenics, and psy- 
chotics in general, are poorer in learning 


tasks than normals because of height- 
ened intrusion of irrelevant (wrong) re- 
sponses. 

E A Several workers 
(e.g., Cameron & Margaret, 1951) hayi 
suggested that schizophrenics overgen 
eralize. This does not seem to be true 
when schizophrenics are taken as a 
whole; however, it might be true for 
specific subtypes. Bender and Schilder 
(1930) found overgeneralization along 
a pitch dimension in a group of catatonic 
schizophrenics when compared with nor- 
mals, while Mednick (1955), using a 
visual stimulus, obtained no differences 
when unspecified schizophrenics were 
used. Garmezy (1952) did not find any 
differences in visual discrimination of 
normals and schizophrenics, thus sug- 
gesting no difference in stimulus gen- 
eralization. 

Responsiveness, Using several differ- 
ent measures, there is evidence that 
schizophrenics are physiologically less 
responsive than normals. The other psy- 
chotic groups have not been studied from 
this point of view, However in one 
study (Huston & Shakow, 1936) there 
was no difference in the speed of tapping 
of manic-depressives and normals. Speed 
of tapping and reaction time are both 
slower in schizophrenics than in nor- 
mals (Huston & Shakow, 1936; Huston, 
Shakow, & Riggs, 1937; Rodnick & 
Shakow, 1940; Venables & Tizard, 
1956c). Speed of tapping also varied 
according to type of schizophrenia with 
the results distributing themselves as 
they did with reaction time; that is, 
Paranoids were the fastest, catatonics 
slowest, with mixed and hebephrenic 
types intermediate (Huston & Shakow, 
1936). Also, in support of lowered re- 
sponsiveness or excitability, Tizard and 
Venables (1957) found that a simple 
increase in the level of stimulation (i- 
lumination of room) caused a significant 
increase in the reaction time of non- 
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paranoid schizophrenics. This effect was 
not obtained with paranoid schizo- 
phrenics. Peters and Murphree (1954) 
report the unconditioned PGR was less 
for schizophrenics than normals. Wino- 
kur, Guze, Stewart, Pfeiffer, Stern, and 
Hornung (1959) have even suggested 
that perhaps all abnormal subjects could 
be differentiated in terms of uncondi- 
tioned PGR, and they present evidence 
to support this. They obtained a posi- 
tive relationship between the number of 
unconditioned PGR responses given to 
an alerting stimulus (tone) and rate of 
classical conditioning. In a study of 
general physiological responsiveness, the 
schizophrenic was not different from 
normal controls (Malmo, Shagass, & 
Smith, 1951). 

Reactive Inhibition. Many workers 
(Huston & Shakow, 1936; Huston, 
Shakow, & Riggs, 1937; O’Connor, 
1957; Venables & Tizard, 1956c) have 
stated that psychotics show increasing 
work decrement and overall low level 
of responding because of heightened 
susceptibility to reactive inhibition. 
This hypothesis has been tested in two 
ways: studying disinhibition by intro- 
cucing extraneous stimuli into the 
learning process and studying reminis- 
cence. If reactive inhibition were greater 
in schizophrenics than in normals, then 
disinhibition would be expected to be 
greater. This has been obtained using 
extraneous stimuli (white noise) as a 
disinhibitor and reaction time as the 
dependent variable (Lang, 1959; Pascal 
& Swensen, 1952; Tizard & Venables, 
1957). However, other studies indicate 
that this heightened disinhibition can be 
explained in terms other than reactive 
inhibition. Karras (1962), duplicating 
Lang’s study, demonstrated that the in- 
troduction of extraneous stimuli served 
as added reinforcement; that is, the sub- 
Ject was motivated to escape the white 
noise and thus reaction time was de- 
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creased. He maintained that in Lang’s 
procedural setup the white noise was 
terminated shortly after a response was 
given, thus acting as reinforcement. In 
another study (Tizard & Venables, 
1957), a facilitating effect was obtained 
by just increasing background stimula- 
tion. Thus, the facilitating effect of the 
introduction of extraneous stimuli might 
be the result of either the escape nature 
of the task, or of increased arousal, or 
both. Further, if reactive inhibition were 
greater for schizophrenics, then reminis- 
cence would be expected to be greater. 
Several workers (O’Connor 1957; Ven- 
ables, 1959) indicate that this is not the 
case when simple motor (O’Connor, 
1957) and reaction-time tasks (Ven- 
ables, 1959) are used. 

Reinforcement. The nature of the re- 
inforcing stimulus situation is an im- 
portant variable in all learning studies, 
perhaps especially so in studies using 
abnormal subjects. Although not ex- 
amined directly, there is evidence to 
indicate that what constitutes reinforce- 
ment often varies for abnormal and nor- 
mal subjects. In a study on operant 
motor conditioning in psychotics (Linds- 
ley, 1956), normal subjects worked well 
for certain rewards whereas psychotic 
subjects did not. 

Schizophrenics might learn better un- 
der escape conditions than reward 
(Cavanagh, 1958; Cohen, 1956; Karras, 
1962). Cohen (1956) taught schizo- 
phrenics two temporal mazes by shock- 
ing the incorrect responses. Shock im- 
proved learning of the second maze only, 
and had no effect on the learning of 
normals. It is not clear if this represents 
real improvement, or only recovery from 
the initial disruptive effects of shock. If 
schizophrenics are overly responsive to 
shock they might show a learning defi- 
cit on the first maze, but improved 
learning on the second due to anapi 
tion. Schizophrenics produced more no 
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mal concepts on a concept formation test 
when a correct response, that is, a more 
“normal” concept, terminated white 
noise than when it did not (Cavanagh, 
1958). Might a reduction of stimulation 
be more reinforcing for schizophrenics 
than normals? 

In a well-controlled experiment using 
schizophrenics, Karras (1962) found 
that avoidance of a loud tone was more 
reinforcing than nonavoidance in reduc- 
ing reaction time, also, the more intense 
the tone the shorter the latency. Even 
when the tone could not be avoided, 
reaction time was somewhat reduced. 
Karras concluded that improvement was 
a function of both reinforcement and 
arousal, and that if either one were low 
or lacking, learning would be impaired. 
On the surface it appears that these find- 
ings are contradicted by studies in which 
punishment was used, Several workers 

(Aisenberg, 1957; Garmezy, 1952) have 
reported that punishment hampers 


learning, but others haye found facili- 
tation 1958; 


- Punishment and 
bject sample, so 


not occur learning is 
Meaning. It is 
meaning ascribed to 
ics differs from that 
As a matter of fact, abnormality is often 
defined in terms of the unusualness with 
which a standard stimulus is interpreted, 
It seems reasonable then that meaning 
would differentially affect learning, 
Davis and Harrington (1957) have 
shown that human stimuli disrupt prob- 
lem solving in schizophrenics. In another 
study (Gladis & Wischner, 1962), it was 
more difficult for schizophrenics to learn 


axiomatic that the 


stimuli by psychot- 
of normal persons. 


“aversive” words than neutral ones in a 
paired-associate task. One might postu- 
late that schizophrenics have difficulty 
learning because a great many stimuli 
become aversive. This could account for 
the fact that schizophrenics learn best 
under avoidance conditions. Here the 
avoidance is rewarding because many 
stimuli are aversive, 

Anxiety. Many studies show that 
clinical and manifest (test) anxiety vari- 
ously affect the course of learning many 
types of tasks (Diethelm & Jones, 1947; 
Gilberstadt & Davenport, 1960; Hunt, 
1959; Korchin & Levine, 1957; Welch 
& Kubis, 1947b). Although no investi- 
gation was found in which anxiety was 
systematically varied in psychotic pa- 
tients, it seems reasonable to assume 
that anxiety has the same effect on their 
learning as it does on that of neurotics. 
Using mixed neurotics and psychotics, 
anxiety seems to increase condition- 
ability (Gilberstadt & Davenport, 1960; 
Welch & Kubis, 1947b) but to impair 
complex learning (Diethelm & Jones, 
1947; Korchin & Levine, 1957), except 
when noxious cues are present, and then 
learning is facilitated (Rosenbaum, 
1956). Thus, anxiety has different ef- 
fects depending upon: the type of learn- 
ing task; complexity of task; and nature 
of reinforcing conditions, for example, 
avoidance, punishment, reward, etc. 
These studies used heterogeneous par 
tient groups and did separate neurotics 
from Psychotics, etc. 

Severity of Illness. There are many 
indications that severity of illness is in- 
versely related to learning performance} 
that is, the “deeper” the psychosis and 
the more chronic, the poorer the learning 
(Hall & Crookes, 1951; Mednick & 
Lindsley, 1958; Peters, 1956; Smith, 
1959). In one study (King, Merrell, 
Lovinger, & Denny, 1957) this did not 
hold, but here only acute schizophrenics 
were used, and since acute schizophren- 
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ics have been shown to be alike in learn- 
ing performance (Long, 1960) the pre- 
sumption is that they are all equally ill. 

Motivation. The poor learning of 
schizophrenics might be due in part to 
their lack of cooperation or proper moti- 
vation (Cohen, 1956; Huston & Shakow, 
1946; Kent, 1911; Tizard & Venables, 
1957). Psychotics, especially schizo- 
phrenics, have all been thought to be 
deficient in motivation, cooperation, and 
attention. Unfortunately these concepts 
have not been differentiated in opera- 
tional terms; thus, it is not possible at 
this point to decide between them. 


LEARNING OF NEuROTICS 


Few studies have been conducted con- 
cerning the learning of neurotics. This 
is probably due in part to the difficulties 
involved in obtaining a neurotic group. 
Since they are not hospitalized it is diffi- 
cult to observe them over extended 
periods of time. Also, on theoretical 
grounds there is less reason to expect 
learning deficiencies in neurotics, and 
therefore interest has been somewhat 
less than it has been in psychotics. 


Anxiety Neurotics 


It appears that anxiety neurotics are 
inferior to normals in verbal learning, 
both rote serial (Malmo & Amsel, 1948) 
and paired-associate (Hall & Crookes, 
1952). Because the subject population 
1s not clearly defined it is difficult to 
compare the results of different experi- 
ments on the conditioning of anxiety 
neurotics, Taylor and Spence (1954) 
did not find any differences in the con- 
ditioning of anxiety neurotics, other 
a tics; and normals, Yet in another 
na mient (Franks, 1956), anxiety 
C conditioned better than hys- 
ferion : have been shown to be in- 
Hilbersta orm als (Hilberstam, 1961). 
introverten ( 1961) also reported that 

neurotics condition better 


than normals. Are these anxiety neurot- 
ics? Two variables are probably con- 
fused here, psychiatric nosology and 
introversion-extraversion (cf. Eysenck, 
1962). Anxiety neurotics conditioned 
better than normals when magnitude of 
the conditioned response during extinc- 
tion was used as a measure of condition- 
ing (Howe, 1958). 


Hysterics 


Hysterics condition differently from 
normals and other neurotics (Franks, 
1956; Hilberstam, 1961). Using eye- 
blink and PGR, hysteric subjects con- 
ditioned poorer than subjects with anx- 
iety states (Franks, 1956). Another 
PGR study found hysteric subjects in- 


ferior to normal subjects (Hilberstam, 
1961). 


Depressives and Obsessive-Com pulsives 


Hall and Crookes (1952) reported 
that depressives were consistently in- 
ferior to normals on paired-associate 
learning as were obsessive-compulsives. 
They did not investigate differences be- 
tween depressives and other neurotics. 


Unspecified 


The studies reported here used neu- 
rotic subjects taken as a group without 
regard to type of neurosis. Several 
(Spence & Taylor, 1953; Taylor & 
Spence, 1954) have failed to find any 
differences in the conditioning of neu- 
rotics and normals. One was found of 
verbal learning and here “emotionally 
disturbed” boys were used (Davids & 
Oliver, 1960). Davids and Oliver used 
a paired-associate task and disturbed 
boys were inferior to normal controls. 
They also used a finger maze, but in 
contrast, the boys were superior to nor- 
mals on this task. Brengelmann (1957) 
obtained similar results using adult neu- 
rotics on a figure-reproduction task. 
Neurotics appear inferior to normals in 
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rant conditioning of gross motor be- 
havioi (Bullock, 1960). Cohen and 
Cohen (1960) have demonstrated that 
neurotics can be verbally conditioned, 
but because no control group was used it 
is unknown if they differ from normals. 


Character Disorders 


Lykken (1957) studied the condition- 
ing (GSR) of psychopaths, and found 
that primary psychopaths (chronic with 
no apparent precipitating cause) were 


significantly poorer than neurotic psy- 
chopaths and normals. 


Variables Afjecting the Learning of 
Neurotics 

The reasons advanced to explain the 
learning of neurotic subjects are similar 


to those used to explain the learning of 
psychotic subjects. Among the more 
common ones explored experimentally 
are: reactive inhibition (Malmo & 
Wallerstein, 1955), connotative mean- 
ing of words in a Paired-associate task 
(Davids & Oliver, 1960), interference 
from ' Previously learned material 
(Davids & Oliver, 1960; Malmo & 
Amsel, 1948), physiological reactivity 
(Winokur et al., 1959), and anxiety 
(Hunt, 1959), 
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